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5 METHODS AND PRODUCTS FOR ANALYZING POLYMERS 

Background 

The study of molecular and cellular biology is focused on the macroscopic structure of cells. 
We now know that cells have a complex microstructure that determine the functionality of the cell. 

10 Much of the diversity associated with cellular structure and function is due to the ability of a cell to 
assemble various building blocks into diverse chemical compounds. The cell accomplishes this task 
by assembling polymers from a limited set of building blocks referred to as monomers. The key to 
the diverse functionality of polymers is based in the primary sequence of the monomers within the 
polymer and is integral to understanding the basis for cellular function, such as why a cell 

15 differentiates in a particular manner or how a cell will respond to treatment with a particular drug. 

The ability to identify the structure of polymers by identifying their sequence of monomers 
is integral to the understanding of each active component and the role that component plays within 
a cell. By determining the sequences of polymers it is possible to generate expression maps, to 
determine what proteins are expressed, to understand where mutations occur in a disease state, and 

20 to determine whether a polysaccharide has better function or loses function when a particular 
monomer is absent or mutated. 

Expression maps relate to determining mRNA expression patterns. The need to identify 
differentially expressed mRNAs is critical in the understanding of genetic programming, both 
temporally and spatially. Different genes are turned on and off during the temporal course of an 

25 organisms' life development, comprising embryonic, growth, and aging stages. In addition to 
developmental changes, there are also temporal changes in response to varying stimuli such as 
injury, drugs, foreign bodies, and stress. The ability to chart expression changes for specific sets of 
cells in time either in response to stimuli or in growth allows the generation of what are called 
temporal expression maps. On the other hand, there are also body expression maps, which include 

30 knowledge of differentially expressed genes for different tissues and cell types. Expression maps 
are different not only between species and between individuals, but also between diseased and 
disease-free states. Examination of differential gene expression has yielded key discoveries of genes 
in widely varying disciplines, such as signal transduction (Smith et al., 1990), circadian rhythms 
(Loros et al., 1989), fruit ripening (Wilinson et al., 1995), hunger (Qu et al., 1996), cell cycle control 




PCT/US98/03024 



PNSOOCID- <WO 



P935012A2 I > 




« .• 

WO 98/35012 PCT/US98/03024 

-2- 

(el-Deiry et al., 1 993), apoptosis (Woronicz et al., 1 994), and ischemic injury (Wang et al., 1 995), 
among many others. Since generation of expression maps involve the sequencing and identification 
of cDNA or mRNA, more rapid sequencing necessarily means more rapid generation of multiple 
expression maps. 

5 Currently, only 1 % of the human genome and an even smaller amount of other genomes have 

been sequenced. In addition, only one very incomplete human body expression map using expressed 
sequence tags has been achieved (Adams et al., 1995). Current protocols for genomic sequencing 
are slow and involve laborious steps such as cloning, generation of genomic libraries, colony 
picking, and sequencing. The time to create even one partial genomic library is on the order of 

10 several months. Even after the establishment of libraries, there are time lags in the preparation of 
DNA for sequencing and the running of actual sequencing steps. Given the multiplicative effect of 
these unfavorable facts, it is evident that the sequencing of even one genome requires an enormous 
investment of money, time, and effort. 

In general DNA sequencing is currently performed using one of two methods. The first and 

15 more popular method is the dideoxy chain termination method described by Sanger et al. (1977). 
This method involves the enzymatic synthesis of DNA molecules terminating in dideoxynucleotides. 
By using the four ddNTPs, a population of molecules terminating at each position of the target DNA 
can be synthesized. Subsequent analysis yields information on the length of the DNA molecules and 
the base at which each molecule terminates (either A, C, G, or T). With this information, the DNA 

20 sequence can be determined. The second method is Maxam and Gilbert sequencing (Maxam and 
Gilbert, 1977), which yses chemical degradation to generate a population of molecules degraded at 
certain positions of the target DNA. With knowledge of the cleavage specificities of the chemical 
reactions and the lengths of the fragments, the DNA sequence is generated. Both methods rely on 
polyacrylamide gel electrophoresis and photographic visualization of the radioactive DNA 

25 fragments. Each process takes about 1-3 days. The Sanger sequencing reactions can only generate 
300-800 bases in one run. 

Methods to improve the output of sequence information using the Sanger method also have 
been proposed. These Sanger-based methods include multiplex sequencing, capillary gel 
electrophoresis, and automated gel electrophoresis. Recently, there has also been increasing interest 

30 in developing Sanger independent methods as well. Sanger independent methods use a completely 
different methodology to realize the base information. This category contains the most novel 
techniques, which include scanning electron microscopy (STM), mass spectrometry, enzymatic 
luminometric inorganic pyrophosphate detection assay (ELIDA) sequencing, exonuclease 
sequencing, and sequencing by hybridization. A brief summary of these methods is set forth below. 

35 Currently, automated gel electrophoresis is the most widely used method of large-scale 
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sequencing. Automation requires reading of fluorescently labeled Sanger fragments in real time with 
a charge coupled device (CCD) detector. The four different dideoxy chain termination reactions are 
run with different labeled primers. The reaction mixtures are combined and co-electrophoresed 
down a slab of polyacrylamide. Using laser excitation at the end of the gel, the separated DNA 

5 fragments are resolved and the sequence determined by computer. Many automated machines are 
available commercially, each employing different detection methods and labeling schemes. The most 
efficient of these is the Applied Biosystems Model 377XL, which generates a maximum actual rate 
of 1 15,200 bases per day. 

In the method of capillary gel-electrophoresis, reaction samples are analyzed by small 

10 diameter, gel-filled capillaries. The small diameter of the capillaries (50 (im) allows for efficient 
dissipation of heat generated during electrophoresis. Thus, high field strengths can be used without 
excessive Joule heating (400 V/m), lowering the separation time to about 20 minutes per reaction 
run. Not only are the bases separated more rapidly, there is also increased resolution over 
conventional gel electrophoresis. Furthermore, many capillaries are analyzed in parallel (Wooley 

15 and Mathies, 1995), allowing amplification of base information generated (actual rate is equal to 
200,000 bases/day). The main drawback is that there is not continuous loading of the capillaries 
since a new gel-filled capillary tube must be prepared for each reaction. Capillary gel 
electrophoresis machines have recently been commercialized. 

Multiplex sequencing is a method which more efficiently uses electrophoretic gels (Church 

20 and Kieffer-Higgins, 1988). Sanger reaction samples are first tagged with unique oligomers and then 
up to 20 different samples are run on one lane of the electrophoretic gel. The samples are then 
blotted onto a membrane. The membrane is then sequentially probed with oligomers that correspond 
to the tags on the Sanger reaction samples. The membrane is washed and reprobed successively until 
the sequences of all 20 samples are determined. Even though there is a substantial reduction in the 

25 number of gels run, the washing and hybridizing' steps are as equally laborious as running 
electrophoretic gels. The actual sequencing rate is comparable to that of automated gel 
electrophoresis. 

Sequencing by mass spectrometry was first introduced in the late 80' s. Recent developments 
in the field have allowed for better sequence determination (Crain, 1990; Little et al., 1994; Keough 

30 et al., 1993; Smirnov et al., 1996). Mass spectrometry sequencing first entails creating a population 
of nested DNA molecules that differ in length by one base. Subsequent analysis of the fragments is 
performed by mass spectrometry. In one example, an exonuclease is used to partially digest a 33- 
mer (Smirnov, 1996). A population of molecules with similar 5' ends and varying points of 3' 
termination is generated. The reaction mixture is then analyzed. The mass spectrometer is sensitive 

35 enough to distinguish mass differences between successive fragments, allowing sequence 
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information to be generated. 

Mass spectrometry sequencing is highly accurate, inexpensive, and rapid compared to 
conventional methods. The major limitation, however, is that the read length is on the order of tens 
of bases. Even the best method, matrix-assisted laser desorption ionization time-of-flight (MALDI- 
5 TOF) mass spectroscopy (Smirnov et aL, 1996), can only achieve maximum read lengths of 80-90 
base pairs. Much longer read lengths are physically impossible due to fragmentation of longer DNA 
at guanidines during the analysis step. Mass spectrometry sequencing is thus limited to verifying 
short primer sequences and has no practical application in large-scale sequencing. 

The Scanning tunneling microscope (STM) sequencing (Ferrell, 1997) method was 

10 conceived at the time the STM was commercially available. The initial promise of being able to read 
base-pair information directly from the electron micrographs no longer holds true. DNA molecules 
must be placed on conducting surfaces, which are usually highly ordered pyrolytic graphite (HOPG) 
or gold. These lack the binding sites to hold DNA strongly enough to resist removal by the physical 
and electronic forces exerted by the tunneling tip. With difficulty, DNA molecules can be 

15 electrostatically adhered to the surfaces. Even with successful immobilization of the DNA, it is 
difficult to distinguish base information because of the extremely high resolutions needed. With 
current technology, purines can be distinguished from pyrimidines, but the individual purines and 
pyrimidines cannot be identified. The ability to achieve this feat requires electron microscopy to be 
able to distinguish between aldehyde and amine groups on the purines and the presence or absence 

20 of methyl groups on the pyrimidines. 

Enzymatic luminometric inorganic pyrophosphate detection assay (ELIDA) sequencing uses 
the detection of pyrophosphate release from DNA polymerization to determine the addition of 
successive bases. The pyrophosphate released by the DNA polymerization reaction is converted to 
ATP by ATP sulfurylase and the ATP production is monitored continuously by firefly luciferase. 

25 To determine base specificity, the method uses successive washes of ATP, CTP, GTP, and TTP. If 
a wash for ATP generates pyrophosphate, one or more adenines are incorporated. The number of 
incorporated bases is directly proportional to the amount of pyrophosphate generated. Enhancement 
of generated sequence information can be accomplished with parallel analysis of many ELIDA 
reactions simultaneously. 

30 The main disadvantage is the short read length. Ronaghi et aL (1996) have only achieved 

a maximum read length of 15 bases because of the multiple washings needed. Since there are four 
washes per base read, this means that a total of 400 washes mush be performed for a read length of 
a hundred bases. If there is even 1% loss of starting material for each wash, after 400 washes there 
would be 1.8% of the starting material remaining, which is insufficient for detection. 

35 Exonuclease sequencing involves a fluorescently labeled, single-stranded DNA molecule 
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which is suspended in a flowing stream and sequentially cleaved by an exonuclease. Individual 
fluorescent bases are then released and passed through a single molecule detection system. The 
temporal sequence of labeled nucleotide detection corresponds to the sequence of the DNA 
(Ambrose et al., 1993; Davis et al., 1992; Jett et al., 1989). Using a processive exonuclease, it 

5 theoretically is possible to sequence 10,000 bp or larger fragments at a rate of 1 0 bases per second. 

In practice, exonuclease sequencing has encountered many difficulties in each of the steps. 
The labeling step requires that all four bases in the DNA be tagged with different fluorophores. 
Sterically, this is extremely unfavorable. Ambrose et al., 1993 has achieved complete labeling of 
two bases on a 7 kb strand of M13 DNA. Furthermore, difficult optical trapping is needed to 

10 suspend DNA molecules in a flowing stream. The step is time intensive and requires considerable 
expertise. Lastly, single molecules of fluorophore need to be detected with high efficiency. Even 
a 1% error is significant. Improvements in detection from 65% to 95% efficiency have been 
achieved. The efficiency of detection has been pushed to the limit and it would be difficult to 
achieve further improvements. 

15 In the sequencing by hybridization method, a target DNA is sequentially probed with a set 

of oligomers consisting of all the possible oligomer sequences. The sequence of the target DNA is 
generated with knowledge of the hybridization patterns between the oligomers and the target (Bains, 
1991; Cantor et al., 1992; Drmanac et al., 1994). There are two possible methods of probing target 
DNA. The "Probe Up" method includes immobilizing the target DNA on a substrate and probing 

20 successively with a set of oligomers. "Probe Down" on the other hand requires that a set of 
oligomers be immobilized on a substrate and hybridized with the target DNA. With the advent of 
the "DNA chip," which applies microchip synthesis techniques to DNA probes, arrays of thousands 
of different DNA probes can be generated on a 1 cm 2 area, making Probe Down methods more 
practical. Probe Up methods would require, for an 8-mer, 65,536 successive probes and washings, 

25 which would take an enormous amount of time. Oh the other hand, Probe Down hybridization 
generates data in a few seconds. With perfect hybridization, 65,536 octamer probes would determine 
a maximum of 1 70 bases. With 65,536 "mixed" 1 1 -mers, 700 bases can be generated. 

In practice, Probe Up methods have been used to generate sequences of about 100 base pairs. 
Imperfect hybridization has led to difficulties in generating adequate sequence. Error in 

30 hybridization is amplified many times. A 1% error rate reduces the maximum length that can be 
sequenced by at least 10%. Thus if 1% of 65,536 oligonucleotides gave false positive hybridization 
signals when hybridizing to a 200-mer DNA target, 75% of the scored "hybridizations" would be 
false (Bains, 1997). Sequence determination would be impossible in such an instance. The 
conclusion is that hybridization must be extremely effective in order to generate reasonable data. 

35 Furthermore, sequencing by hybridization also encounters problems when there are repeats in 
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sequences that are one base less than the length of the probe. When such sequences are present, 
multiple possible sequences are compatible with the hybridization data. 

The most common limitation of most of these techniques is a short read length. In practice 
a short read length means that additional genetic sequence information needs to be sequenced before 
5 the linear order of a target DNA can be deciphered. The short fragments have to be bridged together 
with additional overlapping fragments. Theoretically, with a 500 base read length, a minimum of 
9 x 10 9 bases need to be sequenced before the linear sequence of all 3 x 10 9 bases of the human 
genome are properly ordered. In reality, the number of bases needed to generate a believable 
genome is approximately 2 x 10 10 bases. Comparisons of the different techniques show that only 

10 the impractical exonuclease sequencing has the theoretical capability of long read lengths. The other 
methods have short theoretical read lengths and even shorter realistic read lengths. To reduce the 
number of bases that need to be sequenced, it is clear that the read length must be improved. 

Protein sequencing generally involves chemically induced sequential removal and 
identification of the terminal amino acid residue, e.g., by Edman degradation. See Stryer. L., 

15 Biochemistry, W. H. Freeman and Co., San Francisco (1981) pp. 24-27. Edman degradation 
requires that the polypeptide have a free amino group which is reacted with an isothiocyanate. The 
isothiocyanate is typically phenyl isothiocyanate. The adduct intramolecularly reacts with the nearest 
backbone amide group of the polymer thereby forming a five membered ring. This adduct rearranges 
and the terminal amino acid residue is then cleaved using strong acid. The released 

20 phenylthiohydantoin (PTH) of the amino acid is identified and the shortened polymer can undergo 
repeated cycles of degradation and analysis. 

Further, several new methods have been described for carboxy terminal sequencing of 
polypeptides. See Inglis, A. S., Anal. Biochem. 195:183-96(1991). Carboxy terminal sequencing 
methods mimic Edman degradation but involve sequential degradation from the opposite end of the 

25 polymer. See Inglis, A. S., Anal. Biochem. 195:183-96 (1991). Like Edman degradation, the 
carboxy-terminal sequencing methods involve chemically induced sequential removal and 
identification of the terminal amino acid residue. 

More recently, polypeptide sequencing has been described by preparing a nested set 
(sequence defining set) of polymer fragments followed by mass analysis. See Chait, B. T. et al., 

30 Science 257:1885-94 (1992). Sequence is determined by comparing the relative mass difference 
between fragments with the known masses of the amino acid residues. Though formation of a nested 
(sequence defining) set of polymer fragments is a requirement of DNA sequencing, this method 
differs substantially from the conventional protein sequencing method consisting of sequential 
removal and identification of each residue. Although this method has potential in practice it has 

35 encountered several problems and has not been demonstrated to be an effective method. 
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Each of the known methods for sequencing polymers has drawbacks. For instance most of 
the methods are slow and labor intensive. The gel based DNA sequencing methods require 
approximately 1 to 3 days to identify the sequence of 300-800 units of a polymer. Methods such as 
mass spectroscopy and ELIDA sequencing can only be performed on very short polymers. 
5 An enormous need exists for de noveau polymer sequence determination. The rate of 

sequencing has limited the capability to generate multiple body and temporal expression maps which 
would undoubtedly aid the rapid determination of complex genetic function. A need also exists for 
improved methods for analyzing polymers in order to speed up the rate at which diagnosis of 
diseases and preparation of new medicines is carried out. 

10 

Summary of the Invention 

The invention relates to new methods and products for analyzing polymers and in particular 
new methods and products useful for determining the sequence of polymers. The invention has 
surprising advantages over prior art methods used to sequence polymers. Prior to the present 

1 5 invention no method or combination of methods has come close to achieving the rate of sequencing 
which the instant invention is capable of achieving. Using the methods of the invention the entire 
human genome can be sequenced several orders of magnitude faster than could be accomplished 
using conventional technology. In addition to sequencing the entire genome, the methods and 
products of the invention can be used to create comprehensive and multiple expression maps for 

20 developmental and disease processes. The ability to sequence an individual's genome and to 
generate multiple expression maps will greatly enhance the ability to determine the genetic basis of 
any phenotypic trait or disease process. 

The method for analyzing polymers according to the invention is based on the ability to 
examine each unit of a polymer individually. By examining each unit individually the type of unit 

25 and the position of the unit on the backbone of the polymer can be identified. This can be 
accomplished by positioning a unit at a station and examining a change which occurs when that unit 
is proximate to the station. The change can arise as a result of an interaction that occurs between the 
unit and the station or a partner and is specific for the particular unit. For instance if the polymer 
is a nucleic acid molecule and a T is positioned in proximity to a station a change which is specific 

30 for a T occurs. If on the other hand, a G is positioned in proximity to a station then a change which 
is specific for a G will occur. The specific change which occurs depends on the station used and the 
type of polymer being studied. For instance the change may be an electromagnetic signal which 
arises as a result of the interaction. 

The methods of the invention broadly encompass two types of methods for analyzing 

35 polymers by identifying a unit (or in some cases a group of units) within a polymer. The first type 
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of method involves the analysis of at least a single polymer. An individual unit of the single 
polymer in one aspect is caused to interact with an agent such that a change, e.g., energy transfer or 
quenching occurs and produces a signal. The signal is indicative of the identity of the unit. In 
another aspect an individual unit is exposed to a station resulting in a detectable physical change to 
5 the unit or station. The change in the unit or station produces a signal which can be detected and is 
characteristic of that particular unit. The second type of method involves the analysis of a plurality 
of polymers. A unit of each of the plurality of polymers is positioned at a station where an 
interaction can occur. The interaction is one which produces a polymer dependent impulse that 
specifically identifies the unit. The polymer dependent impulse may arise from, for example, energy 
10 transfer, quenching, changes in conductance, mechanical changes, resistance changes, or any other 
physical change. 

The proposed method for analyzing polymers is particularly useful for determining the 
sequence of units within a DNA molecule and can eliminate the need for generating genomic 
libraries, cloning, and colony picking, all of which constitute lengthy pre-sequencing steps that are 

15 major limitations in current genomic-scale sequencing protocols. The methods disclosed herein 
provide much longer read lengths than achieved by the prior art and a million-fold faster sequence 
reading. The proposed read length is on the order of several hundred thousand nucleotides. This 
translates into significantly less need for overlapping and redundant sequences, lowering the real 
amount of DNA that needs to be sequenced before genome reconstruction is possible. 

20 Methods for preparing polymers for analysis are also claimed herein. The combination of 

the long read length and the novel preparation methods results in a much more stream-lined and 
efficient process. Lastly, the actual time taken to read a given number of units of a polymer is a 
million-fold more rapid than current methods because of the tremendous parallel amplification 
supplied by a novel apparatus also claimed herein, which is referred to as a nanochannel plate or a 

25 microchannel plate. The combination of all these factors translates into a method of polymer 
analysis including sequencing that will provide enormous advances in the field of molecular and cell 
biology. 

The ability to sequence polymers such as genomic DNA by the methods described in the 
instant invention will have tremendous implications in the biomedical sciences. The recovery of 

30 genetic data at such a rapid pace will advance the Human Genome Project. The methods and 
products of the invention will allow the capability to prepare multiple expression maps for each 
individual, allowing complete human genetic programs to be deciphered. The ability to compare 
pools of individual genetic data at one time will allow, for the first time, the ability to discover not 
only single gene diseases with ease, but also complex multigene disorders as rapidly as the DNA 

35 itself is sequenced. 
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In one aspect the invention is a method for analyzing a polymer of linked units. The method 
involves the steps of exposing a plurality of individual units of a polymer to an agent selected from 
the group consisting of electromagnetic radiation, a quenching source and a fluorescence excitation 
source, individual units interacting with the agent to produce a detectable signal, and detecting 
5 sequentially the signals resulting from said interaction to analyze the polymer. In one embodiment 
the signal is electromagnetic radiation. In another embodiment the agent is electromagnetic 
radiation. According to an embodiment of the invention individual units of the polymer are labeled 
with a fluorophore. 

The plurality of individual units of the polymer may be sequentially exposed to 

10 electromagnetic radiation by bringing the plurality of individual units in proximity to a light 
emissive compound and exposing the light emissive compound to electromagnetic radiation, and 
wherein the plurality of individual units of the polymer detectably affect emission of electromagnetic 
radiation from the light emissive compound. In another embodiment the plurality of individual units 
of the polymer are sequentially exposed to electromagnetic radiation, and wherein the 

15 electromagnetic radiation detectably affects emission of electromagnetic radiation from the plurality 
of individual units of the polymer to produce the detectable signal. 

According to another embodiment of the invention the method involves the step of moving 
the polymer through a nanochannel in a wall material in order to locate the detectable signal. The 
plurality of individual units of the polymer are sequentially exposed to the agent by moving the 

:u polymer through a nanochannel in a wall material and exposing the plurality of individual units of 
the polymer to the agent at an interaction station at the nanochannel. The agent can be attached to 
(embedded in, covalently attached to the surface of or coated on the surface of) the wall material. 
In one embodiment the wall material includes a plurality of nanochannels, an interaction station at 
each nanochannel, and a plurality of polymers is moved through said nanochannel, only one polymer 

25 passes the interaction station at any given time (more than one polymer may be in a single 
nanochannel at a given time as long as they do not overlap), and signals resulting from the 
interaction of individual units of the polymers and the agent at the interaction stations are detected 
simultaneously. Preferably the nanochannel is fixed in the wall material. 

The signals which are detected can be stored in a database for further analysis. In one 

30 method of analysis these signals can be compared to a pattern of signals from another polymer to 
determine the relatedness of the two polymers. Alternatively the detected signals can be compared 
to a known pattern of signals characteristic of a known polymer to determine the relatedness of the 
polymer being analyzed to the known polymer. The analysis may also involve measuring the length 
of time elapsed between detection of a first signal from the first unit and a second signal from a 

35 second unit. In one embodiment the plurality of individual units are two units, a first unit at a first 
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end of the polymer and a second unit at an opposite second end of the polymer. The time elapsed 
between the sequential detection of signals may indicate the distance between two units or the length 
of the polymer. 

The polymer may be any type of polymer known in the art. In a preferred embodiment the 
5 polymer is selected from the group consisting of a nucleic acid and a protein. In a more preferred 
embodiment the polymer is a nucleic acid. 

The units of the polymer which interact with the agent to produce a signal are labeled. The 
units may be intrinsically labeled or extrinsically labeled. In one embodiment only a portion of the 
units of the polymer are labeled. In another embodiment all of the units are labeled. In yet another 
10 embodiment at least two units of the polymer are labeled differently so as to produce two different 
detectable signals. The units of the polymer may be labeled such that each unit or a specified portion 
of the units is labeled or it may be randomly labeled. 

In another embodiment the plurality of individual units of the polymer are exposed to at least 
two stations positioned in distinct regions of the channel, wherein the interaction between the units 
15 of the polymer and the at least two stations produces at least two signals. 

In one embodiment the individual unit of the polymer is labeled with radiation and the signal 
is electromagnetic radiation in the form of fluorescence. 

In another embodiment the unit is exposed to the agent at a station. Preferably the station 
is a non-liquid material. 

20 In yet another embodiment the plurality of individual units of the polymer are exposed to at 

least two agents and the interaction between the units of the polymer and the at least two agents 
produces at least two signals. The at least two agents may be positioned in distinct regions of a 
channel through which the polymer passes. In one embodiment the at least two signals are different 
signals. In another embodiment the at least two signals are the same signals. 

25 According to another aspect of the invention* a method for analyzing a polymer of linked 

units is provided. The method involves the steps of moving a plurality of individual units of a 
polymer of linked units with respect to a station and detecting sequentially signals arising from a 
detectable physical change in the polymer or the station as individual units pass the station to analyze 
the polymer. This aspect of the invention also encompasses each of the embodiments discussed 

30 above. 

In one embodiment the station is an interaction station and the individual units are exposed 
at the interaction station to an agent that interacts with the individual unit to produce a detectable 
electromagnetic radiation signal characteristic of the interaction. In another embodiment the station 
is a signal generation station and the characteristic signal produced is a polymer dependent impulse. 
35 Preferably the station is a non-liquid material. 
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In another aspect the invention is a method for analyzing a polymer of linked units by 
exposing a plurality of individual units of a polymer to a station to produce to produce a non-ion 
conductance signal resulting from the exposure of the units of the polymer to the station, and 
wherein the station is attached to a wall material having a surface defining a channel. This aspect 
5 of the invention also encompasses each of the embodiments discussed above. 

According to another aspect of the invention a method for identifying an individual unit of 
a polymer is provided. The method involves the steps of transiently exposing the individual unit of 
the polymer to an agent selected from the group consisting of electromagnetic radiation, a quenching 
source and a fluorescence excitation source, the identity of the individual unit being unknown, to 
10 generate an interaction with a detectable electromagnetic radiation signal characteristic of said 
individual unit, detecting said signal, and distinguishing said signal from signals generated from 
adjacent signal generating units of the polymer as an indication of the identity of the individual unit. 

The agent can be one or more fluorophores and preferably is at least three fluorophores. 
When the individual unit is transiently exposed to one or more fluorophores (agent) by positioning 
15 the individual unit within energy transfer proximity of the agent, fluorescence energy transfer occurs 
between the agent and the individual unit. The signal is detected by detecting the fluorescence 
energy transfer. 

In one embodiment the individual unit of the polymer is exposed to the agent by positioning 
the individual unit at an interaction station comprising a nanochannel in a wall material. Preferably 
20 the wall material comprises at least two layers, one of the layers allowing signal generation and the 
other preventing signal generation and the nanochannel traverses both layers. 

According to another aspect the invention is a method for identifying an individual unit of 
a polymer. The method includes the steps of transiently moving the individual unit of the polymer 
relative to a station, the identity of the individual unit being unknown, detecting a signal arising from 
25 a detectable physical change in the unit or the station, and distinguishing said signal from signals 
arising from exposure of adjacent signal generating units of the polymer to the station as an 
indication of the identity of the individual unit. This aspect of the invention also encompasses each 
of the embodiments discussed above. 

In one embodiment the station is an interaction station and the individual units are exposed 
30 at the interaction station to an agent that interacts with the individual unit to produce a detectable 
electromagnetic radiation signal characteristic of the interaction. In another embodiment the station 
is a signal generation station and the characteristic signal produced is a polymer dependent impulse. 

In yet another aspect the invention is a method for determining the proximity of two 
individual units of a polymer of linked units. The method includes the steps of moving the polymer 
35 relative to a station, exposing individual units to the station to produce a characteristic signal arising 
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from a detectable physical change in the unit or the station, detecting characteristic signals generated, 
and measuring the amount of time elapsed between detecting characteristic signals, the amount of 
time elapsed being indicative of the proximity of the two individual units. 

In one embodiment the station is an interaction station. In another embodiment the 
5 interaction station includes an agent and the agent is selected from the group consisting of 
electromagnetic radiation, a quenching source and a fluorescence excitation source and the 
characteristic signal is a detectable electromagnetic radiation signal. In another embodiment the 
interaction station is a nanochannel in a wall material. 

In certain other embodiments the station referred to is a signal generation station. In another 

10 embodiment the signal generation station includes a physical impulse source which interacts with 
the polymer to produce a characteristic signal which is a physical impulse. The physical impulse 
in one embodiment arises from a change in a physical quantity such as resistance or conductance as 
a result of the exposure of the physical impulse source to the unit of the polymer. In one 
embodiment the physical impulse arises from changes in capacitance or resistance caused by the 

! 5 movement of the unit between microelectrodes or nanoelectrodes positioned adjacent to the polymer 
unit. For instance the signal generation station may include microelectrodes or nanoelectrodes 
positioned on opposite sides of the polymer unit. The changes in resistance or conductance which 
occur as a result of the movement of the unit past the electrodes will be specific for the particular 
unit. In another embodiment the physical impulse arises from a release of radioactive signal from 

20 the unit. In other embodiments it arises from piezoelectric tip, direct physical contact, and NMR- 
nuclear spin signal. 

The polymer may be any type of polymer known in the art. In a preferred embodiment the 
polymer is selected from the group consisting of a nucleic acid and a protein. In a more preferred 
embodiment the polymer is a nucleic acid. The polymers can be labeled, randomly or non randomly. 
25 Different labels can be used to label different linked units to produce different signals. In one 
embodiment the individual units of the polymer are labeled with a fluorophore. 

A method for determining the order of two individual units of a polymer of linked units is 
provided in another aspect of the invention. The method involves the steps of moving the polymer 
linearly with respect to a station, exposing one of the individual units to the station to produce a 
30 signal arising from a detectable physical change in the unit or the station, exposing the other of the 
individual units to the station to produce a second detectable signal arising from a detectable 
physical change in the unit or the station, different from the first signal, and determining the order 
of the signals as an indication of the order of the two individual units. 

In one embodiment the station is an interaction station. In another embodiment the 
35 interaction station includes an agent and the agent is selected from the group consisting of 
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electromagnetic radiation, a quenching source and a fluorescence excitation source and the 
characteristic signals produced are detectable electromagnetic radiation signals. In another 
embodiment the interaction station is a nanochannel in a wall material. 

In certain other embodiments the station referred to is a signal generation station. In another 

5 embodiment the signal generation station includes a physical impulse source which interacts with 
the polymer to produce a characteristic signal which is a physical impulse. The physical impulse 
in one embodiment arises from a change in a physical quantity such as resistance or conductance as 
a result of the exposure of the physical impulse source to the unit of the polymer. In one 
embodiment the physical impulse arises from changes in capacitance or resistance caused by the 

10 movement of the unit between microelectrodes or nanoelcctrodes positioned adjacent to the polymer 
unit. For instance the signal generation station may include microelectrodes or nanoelectrodes 
positioned on opposite sides of the polymer unit. The changes in resistance or conductance which 
occur as a result of the movement of the unit past the electrodes will be specific for the particular 
unit. In another embodiment the physical impulse arises from a release of radioactive signal from 

15 the unit. In other embodiments it arises from piezoelectric tip, direct physical contact, and NMR- 
nuclear spin signal. 

The polymer may be any type of polymer known in the art. In a preferred embodiment the 
polymer is selected from the group consisting of a nucleic acid and a protein. In a more preferred 
embodiment the polymer is a nucleic acid. The polymers can be labeled, randomly or non randomly. 

20 Different labels can be used to label different linked units to produce different signals. In one 
embodiment the individual units of the polymer are labeled with a fluorophore. In another 
embodiment the individual units of the polymer are labeled with radioactivity. 

According to yet another aspect of the invention a method for determining the distance 
between two individual units of a polymer of linked units is provided. The method involves the 

25 steps of causing the polymer to pass linearly relative to a station, detecting a characteristic signal 
generated as each of the two individual units passes by the station, measuring the time elapsed 
between the signals measured, repeating steps 1, 2 and 3 for a plurality of similar polymers to 
produce a data set, and determining the distance between the two individual units based upon the 
information obtained from said plurality of similar polymers by analyzing the data set. 

30 In one embodiment the station is an interaction station. In another embodiment the 

interaction station includes an agent and the agent is selected from the group consisting of 
electromagnetic- radiation, a quenching source and a fluorescence excitation source. In another 
embodiment the characteristic signals produced are detectable electromagnetic radiation signals. In 
another embodiment the interaction station is a nanochannel in a wall material. 

35 In certain other embodiments the station referred to is a signal generation station: In another 
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embodiment the signal generation station includes a physical impulse source which interacts with 
the polymer to produce a characteristic signal which is a physical impulse. The physical impulse 
in one embodiment arises from a change in a physical quantity such as resistance or conductance as 
a result of the exposure of the physical impulse source to the unit of the polymer. In one 
5 embodiment the physical impulse arises from changes in capacitance or resistance caused by the 
movement of the unit between microelectrodes or nanoelectrodes positioned adjacent to the polymer 
unit. For instance the signal generation station may include microelectrodes or nanoelectrodes 
positioned on opposite sides of the polymer unit. The changes in resistance or conductance which 
occur as a result of the movement of the unit past the electrodes will be specific for the particular 

10 unit. In another embodiment the two linked units are detected at the signal generation station by 
measuring light emission at the station. In another embodiment the physical impulse arises from a 
release of radioactive signal from the unit. In other embodiments it arises from piezoelectric tip, 
direct physical contact, and NMR-nuclear spin signal. 

The polymer may be any type of polymer known in the art. In a preferred embodiment the 

15 polymer is selected from the group consisting of a nucleic acid and a protein. In a more preferred 
embodiment the polymer is a nucleic acid. The polymers can be labeled, randomly or non randomly. 
Different labels can be used to label different linked units to produce different signals. In one 
embodiment the individual units of the polymer are labeled with a fluorophore. 

According to another embodiment the plurality of similar polymers is a homogeneous 

20 population. In another embodiment the plurality of similar polymers is a heterogenous population. 
In another embodiment steps (1) - (4) are carried out substantially simultaneously. 
According to yet another aspect of the invention a method for detecting resonance energy 
transfer or quenching between two interactive partners capable of such transfer or quenching is 
disclosed. The method involves the steps of bringing the two partners in close enough proximity 

25 to permit such transfer or quenching, applying an agent to one of said partners, the agent selected 
from the group consisting of electromagnetic radiation, a quenching source and a fluorescence 
excitation source, shielding fluorescence resonance energy transfer and quenching occurring from 
electromagnetic radiation emission and interaction between said partners with a material shield, and 
detecting the emitted electromagnetic radiation. In a preferred embodiment the material shield is 

30 a conductive material shield. 

In another aspect the invention is a method for analyzing a polymer of linked units. The 
method includes the steps of providing a labeled polymer of linked units, detecting signals from unit 
specific markers of less than all of the linked units, and storing a signature of said signals detected 
to analyze the polymer. In one embodiment all of the unit specific markers are detected. In another 

35 embodiment the polymer is partially and randomly labeled with unit specific markers. In yet another 
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embodiment only a portion of the unit specific markers are detected. All of the units of the polymer 
are labeled with a unit specific marker in another embodiment. 

The labeled polymer of linked units in one embodiment is exposed to an agent selected from 
the group consisting of electromagnetic radiation, a quenching source and a fluorescence excitation 
5 source and the signals are produced by the interaction between a unit specific marker of the polymer 
and the agent. 

In one embodiment the signals are detected linearly. In another embodiment the signature 
of signals includes at least 10 signals, and preferably 20 signals. The signature of signals includes 
any information about the polymer. Preferably the signature of signals includes information about 

10 the order, distance and number of unit specific markers. 

In another embodiment the labeled polymer of linked units is moved with respect to a station 
and wherein the signals are generated upon exposure of a unit specific marker of the polymer to the 
station. The station may be an interaction station. 

The method in some embodiments is a method for identifying a unit specific marker of the 

15 polymer, the identity of the unit specific marker being indicative of the identity of at least one unit 
of the polymer. The unit specific marker is transiently exposed to a station to produce signals 
characteristic of said unit specific marker and the signal is distinguished from signals generated from 
adjacent signal generating unit specific markers of the polymer as an indication of the identity of the 
unit specific marker. The station may be an interaction station including an agent selected from the 

20 group consisting of electromagnetic radiation, a quenching source and a fluorescence excitation 
source and wherein the signals are detectable electromagnetic radiation signals. 

The method in other embodiments is a method for determining the proximity of two unit 
specific markers of the polymer wherein the proximity of the two unit specific markers is the 
signature of said signals, the identity of each unit specific marker being indicative of the identity of 

25 at least one unit of the polymer. The labeled polymer is moved relative to a station to expose the two 
unit specific markers to the station to produce a characteristic signal arising from a detectable 
physical change in the unit specific marker or the station, and the amount of time elapsed between 
detecting each characteristic signal is measured, the amount of time elapsed being indicative of the 
proximity of the two unit specific markers. 

30 The method may also be a method for determining the order of two unit specific markers of 

the polymer, the identity of each unit specific marker being indicative of the identity of at least one 
unit of the polymer wherein the order of the two unit specific markers is the signature of said signals. 
The labeled polymer is moved linearly with respect to a station, to expose one of the unit specific 
markers to the station to produce a signal which is a unit specific marker and to expose the other of 

35 the unit specific markers to the station to produce a second detectable which is a unit specific 
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marker, different from the first signal. The order of the signals determined is an indication of the 
order of the two unit specific markers. 

The method in an embodiment is a method for determining the distance between two unit 
specific markers of the polymer, the identity of each unit specific marker being indicative of the 

5 identity of at least one unit of the polymer wherein the distance between two unit specific markers 
is the signature of said signals. The labeled polymer is moved linearly relative to a station to 
produce a characteristic signal generated as each of the two unit specific markers passes by the 
station and the distance between the signals is determined as an indication of the distance between 
the two unit specific markers. 

10 The method is a method for characterizing a test labeled polymer, wherein a plurality of 

labeled polymers is exposed to a station to obtain the signature of signals for each of the plurality 
of labeled polymers in another embodiment. The method involves the steps of comparing the 
signature of signals of the plurality of polymers, determining the relatedness of the polymers based 
upon similarities between the signature of signals of the polymers, and characterizing the test 

15 polymer based upon the signature of signals of related polymers. 

According to yet another embodiment the method is a method for sequencing a polymer of 
linked units. A signature of signals is obtained from each of a plurality of overlapping polymers, 
at least a portion of each of the polymers having a sequence of linked units identical to the other of 
the polymers, and the signature of signals is compared to obtain a sequence of linked units which 

20 is identical in the plurality of polymers. 

The method in another embodiment is a method for analyzing a set of polymers, each 
polymer of said set being an individual polymer of linked units and wherein the set of polymers is 
oriented parallel to one another and a polymer specific feature of said polymers is detected: 

Each of the above methods is based on an interaction between a polymer and a station 

25 involving in some embodiments energy transfer or quenching between a unit and an agent which 
results in the generation of a signal and in other embodiments a physical change in the unit or station 
which results in the generation of a signal. Each of the methods can be performed on many polymers 
simultaneously or on as few as one polymer at a time. 

Methods for analyzing multiple polymers at one time based on an interaction involving 

30 polymer dependent impulses between the unit and the station also can be performed. These 
methods, which are set forth below, are based on an interaction between a unit and a signal 
generation station which produces any type of polymer dependent impulse which can be detected. 
The polymer dependent impulse is generated by exposure of a unit of the polymer to a signal 
generation station but does not require that a physical change in the polymer unit or the station occur. 

35 For instance, the polymer dependent impulse may result from energy transfer, quenching, changes 



WO 98/35012 PCT/US98/03024 

- 17- 

in conductance, mechanical changes, resistance changes, or atiy other physical change. 

A method for characterizing a test polymer is another aspect of the invention. A method for 
characterizing a test polymer is carried out by obtaining polymer dependent impulses for each of a 
plurality of polymers, comparing the polymer dependent impulses of the plurality of polymers, 
5 determining the relatedness of the polymers based upon similarities between the polymer dependent 
impulses of the polymers, and characterizing the test polymer based upon the polymer dependent 
impulses of related polymers. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. In 
one embodiment the plurality of polymers is a homogenous population. In another embodiment the 
10 plurality of polymers is a heterogenous population. The polymers can be labeled, randomly or non 
randomly. Different labels can be used to label different linked units to produce different polymer 
dependent impulses. 

The polymer dependent impulses provide many different types of structural information 
about the polymer. For instance the obtained polymer dependent impulses may include an order of 

15 polymer dependent impulses or the obtained polymer dependent impulses may include the time of 
separation between specific signals or the number of specific polymer dependent impulses. 

In one important embodiment the polymer dependent impulses are obtained by moving the 
plurality of polymers linearly past a signal generation station. 

According to another aspect the invention is a method for determining the distance between 

20 two individual units of a polymer of linked units. The method involves the steps of (1) causing the 
polymer to pass linearly relative to a station, (2) detecting a polymer dependent impulse generated 
as each of the two individual units passes by the signal generation station, (3) measuring the time 
elapsed between the polymer dependent impulses measured, (4) repeating steps 1, 2 and 3 for a 
plurality of similar polymers to produce a data set, and (5) determining the distance between the two 

25 individual units based upon the information obtained from said plurality of similar polymers by 
analyzing the data set. In one embodiment steps (1) - (4) are carried out substantially 
simultaneously. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. In 
one embodiment the plurality of polymers is a homogenous population. In another embodiment the 
30 plurality of polymers is a heterogenous population. The polymers can be labeled, randomly or non 
randomly. Different labels can be used to label different linked units to produce different polymer 
dependent impulses. 

In one embodiment the polymer dependent impulse measured is an electromagnetic radiation 
signal generated. In another embodiment the two linked units are detected at the signal generation 
35 station by measuring light emission at the station. The signal generation station can be a 
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nanochannel. 

According to another aspect the invention is a method for determining the order of two 
individual units of a polymer of linked units. The method involves the steps of (1) moving the 
polymer to linearly with respect to a signal generation station, (2) exposing one of the individual 

5 units to the station to produce a polymer dependent impulse, (3) exposing the other of the individual 
units to the station to produce a second polymer dependent impulse, (4) repeating steps 1 , 2 and 3 
for a plurality of similar polymers to produce a data set, and (5) determining the order of the two 
individual units based upon the information obtained from said plurality of similar polymers by 
analyzing the data set. In one embodiment steps (1) - (4) are carried out substantially 

10 simultaneously. In one embodiment the signal measured is an electromagnetic radiation signal. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. In 
one embodiment the plurality of polymers is a homogenous population. In another embodiment the 
plurality of polymers is a heterogenous population. The polymers can be labeled, randomly or non 
randomly. Different labels can be used to label different linked units to produce different polymer 

15 dependent impulses. 

In one embodiment the polymer dependent impulse measured is an electromagnetic radiation 
signal generated. In another embodiment the two linked units are detected at the signal generation 
station by measuring light emission at the station. The signal generation station can be a 
nanochannel. 

20 In another aspect of the invention a method for sequencing a polymer of linked units is 

provided. The method involves the steps of obtaining polymer dependent impulses from a plurality 
of overlapping polymers, at least a portion of the polymers having a sequence of linked units 
identical to the other of the polymers, and comparing the polymer dependent impulses to obtain a 
sequence of linked units which is identical in the plurality of polymers. 

25 In one embodiment the polymer dependent impulses are optically detectable. In another 

embodiment the nucleic acids are labeled with an agent selected from the group consisting of an 
electromagnetic radiation source, a quenching source, a fluorescence excitation source, and a 
radiation source. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. In 
30 one embodiment the plurality of polymers is a homogenous population. In another embodiment the 
plurality of polymers is a heterogenous population. The polymers can be labeled, randomly or non 
randomly. Different labels can be used to label different linked units to produce different polymer 
dependent impulses. 

A method for labeling nucleic acids is also provided. The method involves the step of 
35 contacting a dividing cell with a nucleotide analog, isolating from the cell nucleic acids that have 
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incorporated the nucleotide analog, and modifying the nucleic acid with incorporated nucleotide 
analog by labeling the incorporated nucleotide analog. In one embodiment the nucleotide analog 
is a brominated analog. 

The dividing cell may optionally be contacted with a nucleotide analog by growth arresting 
5 the cell in the cell division cycle, performing the contacting step, and allowing the cell to reenter the 
cell division cycle. The nucleic acids may then be isolated after the cells have reentered and 
completed the cell division cycle and before a second cell division cycle is completed. 

In another embodiment the incorporated nucleotide analog is labeled with an agent selected 
from the group consisting of an electromagnetic radiation source, a quenching source and a 
10 fluorescence excitation source. 

According to another aspect of the invention a method is provided for analyzing a set of 
polymers, each polymer of said set being an individual polymer of linked units. The method 
involves the step of orienting the set of polymers parallel to one another, and detecting a polymer 
specific feature of said polymers. In one embodiment the orientation step is in a solution free of gel. 
15 The polymers may be oriented using any method. A preferred method for orienting the polymers 
is to apply an electric field to the polymers. 

The plurality of polymers may be any type of polymer but preferably is a nucleic acid. In 
one embodiment the plurality of polymers is a homogenous population. In another embodiment the 
plurality of polymers is a heterogenous population. The polymers can be labeled, randomly or non 
20 random!) . Different labels can be used to label different linked units to produce different polymer 
dependent impulses. 

The polymer specific feature is information about a structural feature of a polymer. The 
polymer specific feature can be an order of linked unity in the polymers. 

In one embodiment the detecting step is performed simultaneously for said polymers. In 
25 another embodiment the detection step comprises measuring electromagnetic radiation signals. 
According to a preferred embodiment the detection step comprises causing the polymers to pass 
linearly relative to a plurality of signal generation stations, and detecting and distinguishing polymer 
dependent impulses generated as said polymers pass said signal generation stations. 

A method for analyzing a set of polymers, each polymer of the set being an individual 
30 polymer of linked units is provided in another aspect of the invention. The method involves the 
steps of orienting the set of polymers in an electric field, simultaneously moving the set of polymers 
through defined respective channels, and detecting a polymer specific feature as the polymers are 
moved through the channels. In one embodiment the orientation step is in a solution free of gel. 
Preferably the channels are nanochannels. 
35 The plurality of polymers may be any type of polymer but preferably is a nucleic acid. In 
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one embodiment the plurality of polymers is a homogenous population. In another embodiment the 
plurality of polymers is a heterogenous population. The polymers can be labeled, randomly or non 
randomly. Different labels can be used to label different linked units to produce different polymer 
dependent impulses. 

5 The polymer specific feature is information about a structural feature of a polymer. The 

polymer specific feature can be an order of linked unity in the polymers. 

In one embodiment the detecting step is performed simultaneously for said polymers. In 

another embodiment the detection step comprises measuring electromagnetic radiation signals. 

According to a preferred embodiment the detection step comprises causing the polymers to pass 
10 linearly relative to a plurality of signal generation stations, and detecting and distinguishing polymer 

dependent impulses generated as said polymers pass said signal generation stations. 

According to yet another aspect of the invention an article of manufacture is provided. The 

article of manufacture includes a wall material having a surface defining a channel, an agent wherein 

the agent is selected from the group consisting of an electromagnetic radiation source, a quenching 
15 source, a luminescent film layer and a fluorescence excitation source, attached to the wall material 

adjacent to the channel, wherein the agent is close enough to the channel and is present in an amount 

sufficient to detectably interact with a partner compound selected from the group consisting of a 

light emissive compound, a light accepting compound, radiative compound, and a quencher passing 

through the channel. Preferably the channel is a support for a polymer. 
20 The agent in one embodiment is an electromagnetic radiation source and the electromagnetic 

radiation source is a light emissive compound. In another embodiment the channel is selected from 

the group consisting of is a microchanncl and a nanochannel. 

According to another embodiment the surface of the wall material defining the channel is free 

of the light emissive compound. In another embodiment the light emissive compound is attached 
25 to an externa! surface of the wall material. In yet another embodiment the light emissive compound 

is attached to a linker which is attached to the external surface of the wall material. In still another 

embodiment the light emissive compound is embedded in the wall material or in a layer of or upon 

the wall material. The light emissive compound can be concentrated at a region of the external 

surface of the wall material that surrounds a portion of the channel in another embodiment. The light 
30 emissive compound may form a concentric ring in the wall material around a portion of the channel. 

A masking layer having openings which allow exposure of only localized areas of the light emissive 

compound may also be part of the article of manufacture. 

A second light emissive compound different from the first may be attached to the wall 
material adjacent to the channel, wherein the light emissive compound is close enough to the channel 
35 and is present in an amount effective to detectably interact with a partner light emissive compound 
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passing through the channel. 

The wall material may be made up of different layers. In one embodiment the external 
surface of the wall material adjacent to the light emissive compound is a conducting layer. In 
another embodiment the wall material comprises two layers, the conducting layer and a 

5 nonconducting layer. The wall material may also be composed of at least two layers, a first layer 
preventing signal generation and a second layer allowing signal generation. Alternatively the wall 
material adjacent to the light emissive compound is a light impermeable layer. In another 
embodiment the wall material comprises two layers, the light impermeable layer and a support light 
permeable layer. The wall material can be a second light impermeable layer on a second side of the 

10 light emissive compound, the first and second layers sandwiching the light emissive compound. In 
a preferred embodiment the light emissive compound is a fluorescent compound. 

The channel can have any shape or dimensions. Preferably the channel is a nanochannel 
which is between 1 Angstrom and 1 mm. In a preferred embodiment the width of the channel is 
between 1 and 500 Angstroms. Preferably the wall includes multiple channels. Preferably the wall 

15 material includes at least 2 and more preferably at least 50 channels. 

In one embodiment the wall material is formed of two layers, a first light impermeable layer 
and a luminescent film layer attached to one another, wherein the channel extends through both 
layers and is defined by surfaces of both layers. Preferably the. channel is a nanochannel. In a 
preferred embodiment the length of the channel is between 1 Angstrom and 1 mm. The article in 

20 some embodiments includes a second light impermeable layer, the luminescent film layer positioned 
between the first and second light impermeable layers. In a preferred embodiment the surface 
defining the channel includes a surface of the light impermeable layer which is free of luminescent 
film layer material. 

In another embodiment the agent is a fluorescence excitation source and wherein the 
25 fluorescence excitation source is a scintillation layer. The scintillation layer may be selected from 
the group consisting of Nal(TI), ZnS(Ag), anthracene, stilbene, and plastic phosphors. Preferably 
the scintillation layer is embedded in the wall material between two radiation impermeable layers, 
such as lead or Lucite. 

In another aspect the invention is an article of manufacture which is a wall material having 
30 a surface defining a plurality of channels, and a station attached to a discrete region of the wall 
material adjacent to at least one of the channels, wherein the station is close enough to the channel 
and is present in an amount sufficient to cause a signal to arise from a detectable physical change 
in a polymer of linked units passing through the channel or in the station as the polymer is exposed 
to the station. 

35 According to another aspect of the invention an article of manufacture is provided. The 
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article is a wall material having a surface defining a channel, and a plurality of stations each attached 
to a discrete region of the wall material adjacent to the channel, wherein the stations are close 
enough to the channel and are present in an amount sufficient to cause a signal to arise from a 
detectable physical change in a polymer of linked units passing through the channel or in the station 
as the polymer is exposed to the station. 

A method for preparing a wall material is another aspect of the invention. The method 
involves the steps of covalently bonding light emissive compounds or quenching compounds to a 
plurality of discrete locations of a wall material, each of said discrete locations close enough to a 
respective interaction station on said wall material, whereby when an individual unit of a polymer, 
which is interactive with said light emissive compound or quenching compound to produce a signal, 
is positioned at said interaction station, the light emissive compound or the quenching compound 
interacts with the individual unit to produce the signal. In one embodiment the method includes the 
step of applying a layer of conductive material to said wall material. 

In another embodiment the light emissive compounds or quenching compounds are 
covalently bonded at discrete locations close to channels in said wall material, said channels defining 
interaction stations. The channels preferably are microchannels. In a more preferred embodiment 
the channels are nanochannels. The light emissive compounds or quenching compounds can be 
covalently bonded to the wall material in a manner whereby the surfaces of the wall material 
defining the channel are free of the light emissive compounds and quenching compounds. 

The invention also encompasses a method for attaching a chemical substance selectively at 
a rim of a channel through a wall material that is opaque. The method involves the steps of 
providing a wall material with photoprotective chemical groups attached at the rim of the channel 
through the wall material, applying light to the photoprotective chemical groups to dephotoprotect 
the chemical groups, and attaching the chemical substance to the deprotected chemical groups. 

In one embodiment the light is applied to only selected regions of a surface of the wall 
material defining the rim of the channel. In another embodiment the channel has a first end and a 
second end, the rim being at the first end, and wherein the light is applied to the second end, the light 
passing through the channel to contact the photoprotected chemical groups at the rim of said first 
end. The channels preferably are microchannels. In a more preferred embodiment the channels are 
nanochannels. 

According to another aspect of the invention a method is provided for preparing a wall 
material having localized areas of light emission on a surface of the wall material. The method 
involves the steps of providing a wall material having a surface and applying a light emissive 
compound to the surface to produce at least localized areas of light emission on the surface, wherein 
the localized areas define a target region for detecting light emission, and wherein the target region 



WO 98/35012 PCT/US98/03024 

-23- 

is a rim of a channel through the wall material. In one embodiment the method further includes the 
steps of attaching a photoprotective chemical group to the surface of the wall material, applying light 
to the photoprotective chemical groups to dephotoprotect the chemical groups prior to attaching the 
light emissive compound, and attaching the light emissive compound to the dephotoprotected 
5 chemical groups. 

In one embodiment the light is applied to only selected regions of a surface of the wall 
material defining the rim of the channel. In a preferred embodiment the photoprotective chemical 
group is attached to only selected regions of the surface of the wall material defining the rim of the 
channel. In another embodiment the channel has a first end and a second end, the rim being at the 
10 first end, and wherein the light is applied to the second end, the light passing through the channel 
to contact the photoprotected chemical groups at the rim of said first end. The channels preferably 
are microchannels. In a more preferred embodiment the channels are nanochannels. 

The method can include the further step of positioning a mask having openings over the 
surface of the wall material such that only localized areas of light emission are exposed through the 
15 openings of the mask. In one embodiment the light emissive compound is attached to a portion of 
the surface of the wall material. 

According to another aspect of the invention an apparatus for detecting a signal is provided. 
The apparatus is a housing with a buffer chamber, a wall defining a portion of the buffer chamber, 
and having a plurality of openings for aligning polymers, a sensor fixed relative to the housing, the 
20 sensor distinguishing the signals emitted at each opening from the signals emitted at the other of the 
openings to generate opening dependent sensor signals, and a memory for collecting and storing said 
sensor signals. In a preferred embodiment the sensor is an optical sensor. 

In one embodiment the optical sensor senses electromagnetic radiation signals emitted at the 
plurality of openings. In another embodiment the apparatus includes a microprocessor. 
25 In one embodiment the openings are defined by channels in the wall. Preferably the openings 

are defined by microchannels in the wall. More preferably the openings are defined by nanochannels 
in the wall. In one embodiment the plurality of openings is at least two. In a preferred embodiment 
the plurality is at least 50. 

In one embodiment the apparatus includes a second buffer chamber separated from said first 
30 buffer chamber, by said wall, and wherein the buffer chambers are in fluid communications with one 
another via the openings. In another embodiment the apparatus includes a pair of electrodes secured 
to the housing, one of said pair positioned in the first buffer chamber and the other of the pair 
positioned in the second buffer chamber. 

According to another aspect of the invention an apparatus for detecting a signal is provided. 
35 The apparatus includes a housing defining a first buffer chamber and a second buffer chamber, a wall 
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supported by the housing and separating the first and second buffer chambers, a plurality of channels 
defined by the wall and providing fluid communications between the first and second buffer 
chambers, and a sensor for distinguishing and collecting channel dependent signals. Preferably the 
channel is a microchannel. More preferably the channel is a nanochannel. In one embodiment the 

5 plurality of channels is at least two. In a preferred embodiment the plurality is at least 50. 
Preferably the signal is an optical signal. 

In one embodiment the wall surrounding the channel includes an agent is selected from the 
group consisting of electromagnetic radiation, a quenching source and a fluorescence excitation 
source is attached to the wall. Preferably the agent is electromagnetic radiation and the 

10 electromagnetic radiation is a light emissive compound. In one embodiment the light emissive 
compound is concentrated at the channels in the wall. 

According to another embodiment the apparatus includes a means for moving biological 
entities through the channels. 

According to another aspect of the invention an apparatus including a housing with a buffer 

15 chamber, a wall material defining a portion of the buffer chamber, the wall including polymer 
interaction stations, and an optical sensor secured to the housing, the optical sensor constructed and 
arranged to detect electromagnetic radiation signals emitted at the interaction stations is provided. 

In another aspect the invention is a computer system for making characteristic information 
of a plurality of polymers available in response to a request. The system has a memory for storing, 

20 for the plurality of the polymers and in a manner accessible using a unique identifier for the polymer, 
records including information indicative of sequentially detected signals arising from a detectable 
physical change in the plurality of individual units of the polymer or a station to which the polymer 
is exposed and a processor for accessing the records stored in the memory for a selected one of the 
plurality of the polymers according to a unique identifier associated with the selected polymer. 

25 In one embodiment the signal results from an interaction of a plurality of individual units of 

the polymer exposed to an agent selected from the group consisting of electromagnetic radiation, a 
quenching source and a fluorescence excitation source. In another embodiment the computer system 
also includes a means for comparing the sequentially detected signals of the selected polymer to a 
known pattern of signals characteristic of a known polymer to determine relatedness of the selected 

30 polymer to the known polymer. 

Each of the limitations of the invention can' encompass various embodiments of the 
invention. It is, therefore, anticipated that each of the limitations of the invention involving any one 
element or combinations of elements can be included in each apparatus and each method. 

35 
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Brief Description of the Drawings 

Figure 1 shows a schematic of a random labeling method. 

Figure 2 is a graph of raw data demonstrating changes in energy emission patterns to 
determine distance information through the instantaneous rate method. The changes in energy 
5 patterns result from sequential detectable signals which when plotted produce a curve that from left 
to right shows two energy intensity decreases, followed by two energy intensity increases. The rate 
is 6.8 A/s and t, is the time between the entry of the first and second labels. 

Figure 3 shows a representative population of random A-labeled fragments for a 16-mer with 
the sequence S'ACGTACGTACGTACGT^ and also depicts how distance information is used to 
10 determine base separation between acceptor labeled nucleotides. 

Figure 4 is a schematic depicting various potential constructions of a nanochannel plate. 
Figure 5 is a schematic depicting a nanochannel plate for analyzing a radioactive polymer. 
Figure 6 is a schematic of a cross section of a nanochannel with a concentric ring of donor 
fluorophores and a graph showing donor emission and energy transfer on the corresponding 
15 nanochannel. 

Figure 7 shows a model of a nanochannel plate having multiple nanochannels with a layer 
of donor fluorophores within the plate. Figure 7B shows an enlarged view of a single nanochannel 
with a single acceptor positioned adjacent to a conducting layer. 

Figure 8A and B arc schematics demonstrating signal generation upon passage of an acceptor 
20 label through the nanochannel. Figure 8A shows an enlarged view of one nanochannel. Figure 8B 
graphically illustrates the shaip changes in donor intensity as an acceptor label moves from positions 
A toC. 

Figure 9 schematically and graphically demonstrates the passage of a two-base labeled strand 
of DNA through a nanochannel with the proper thickness and radii of donor fluorophores 
25 sandwiched between conducting material. 

Figure 10 is a volume graph visually demonstrating the change in donor emission in the 
presence of one and two acceptors. Figure 10A illustrates the amount of change as volumes. Figure 
1 OB shows the change for one acceptor. 

Figure 1 1 shows the results of an experiment demonstrating that DNA can pass through 
30 fabricated nanochannels. 

Figure 12 shows an apparatus constructed to hold a nanochannel (or microchannel) plate 
which is capable of generating an electric field. 

Figure 13 depicts a nanochannel apparatus which consists of two fused Pyrex cells that hold 
the nanochannel plate, an upper and lower buffer region, electrodes, an immersion objective and a 
35 voltage supply. 
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Figure 14 is a schematic diagram of in vitro base specific (IBS A) labeling 

Figure 15 shows the general scheme for deciphering DNA sequence from IBS A labeling. 



5 Detailed Description of the Invention 

- The invention encompasses methods of analyzing or identifying a polymer or a unit of a 
polymer, by detecting a signal or polymer dependent impulse that results from an interaction 
between at least one unit of the polymer and a station or an agent or by a change in the unit or a 
station when the unit is exposed to the station. By "analyzing" a polymer, it is meant obtaining some 

10 information about the structure of the polymer such as its size, the order of its units, its relatedness 
to other polymers, the identity of its units, or its presence. Since the structure and function of 
biological molecules are interdependent, the structural information can reveal important information 
about the function of the polymer. 

One method according to the invention is a method for analyzing a polymer of linked units 

15 by exposing a plurality of units of the polymer to an agent selected from the group consisting of 
electromagnetic radiation, a quenching source and a fluorescence excitation source such that each 
of the individual units interacts with the agent to produce a detectable signal. The signal resulting 
from the interaction is detected sequentially. 

As used herein a unit of a polymer is "exposed" to an agent or a station by positioning or 

20 presenting the unit and the agent or station in interactive proximity to one another such that energy 
transfer or quenching or a physical change in the unit or agent or station can occur between them to 
produce a detectable signal. By interactive proximity it is meant close enough to permit the 
interaction or change which yields the detectable signal. 

In one embodiment the units of the polymer are exposed sequentially to the agent. By 

25 "sequentially exposed" it is meant in general separated from one another in time. In a preferred 
embodiment, the polymer and the agent are caused to move relative to one another in a "linear" 
manner such that each unit of the polymer passes within interactive proximity to the agent at an 
interaction station. When each unit of the polymer interacts with the agent or station, a detectable 
signal is produced. This detectable signal can be captured (sequentially detected) and recorded by 

30 a detection device. The detectable signal produced for each unit can be indicative of the type of unit. 
As used herein signals are detected "sequentially" when signals from different units of a single 
polymer are detected spaced apart in time. Not all units need to be detected or need to generate a 
signal to detect signals "sequentially." 

When the units are sequentially exposed to the agent or station the unit and the agent or 

35 station move relative to one another. As used herein the phrase "the unit and the agent or station 
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move relative to one another" means that either the unit and the agent or station are both moving or; ' _ . 
only one of the two is moving and the other remains stationary at least during the period of time of 
the interaction betweenlhe unit and the agent or station. The unit and the agent or station may be 
moved relative to one another by any mechanism. For instance the agent or station may remain 

5 stationary and the polymer may be drawn past the agent or station by an electric current. Other 
methods for moving the polymer include but are not limited to movement resulting from a magnetic 
field, a mechanical force, a flowing liquid medium, a pressure system, a gravitational force, and a 
molecular motor such as e.g., a DNA polymerase or a helicase when the polymer is DNA or e.g., 
myosin when the polymer is a peptide such as actin. The movement of the polymer may be assisted 

10 by the use of a channel, groove or ring to guide the polymer. Alternatively the agent or station may 
be moved and the polymer may remain stationary. For instance the agent or station may be held 
within a scanning tip that is guided along the length of the polymer. 

In another embodiment signals are detected simultaneously. As used herein signals are 
"detected simultaneously" by causing a plurality of the labeled units of a polymer to be exposed to 

15 an agent or station at once. The plurality of the units can be exposed to an agent or station at one 
time by using multiple interaction sites. Signals can be detected at each of these sites simultaneously. 
For instance multiple agents or stations may be localized at specific locations in space which 
correspond to the units of the polymer. When the polymer is brought within interactive proximity 
of the multiple agents or stations signals will be generated simultaneously. This may be embodied, 

20 for example, in a linear array of agents or stations positioned at substantially equivalent distances 
which are equal to the distance between the units. The polymer may be positioned with respect to 
the agent or station such that each unit is in interactive proximity to an agent or station to produce 
simultaneous signals. 

When the signals are detected sequentially multiple polymers also can be analyzed 
25 simultaneously. Multiple polymers are analyzed simultaneously by causing more than one polymer 
to move relative to respective agent or stations at one time. The polymers may be similar or distinct. 
If the polymers are similar, the same or different units may be detected simultaneously. It is 
preferred that at least two polymers are analyzed simultaneously. In a more preferred embodiment 
at least 50 polymers are analyzed simultaneously and in another preferred embodiment at least 100 
30 polymers are analyzed simultaneously. 

A "polymer" as used herein is a compound having a linear backbone of individual units 
which are linked together by linkages. In some cases the backbone of the polymer may be branched. 
Preferably the backbone is unbranched. The term "backbone" is given its usual meaning in the field 
of polymer chemistry. The polymers may be heterogeneous in backbone composition thereby 
35 containing any possible combination of polymer units linked together such as peptide- nucleic acids 
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(which have amino acids linked to nucleic acids and have enhanced stability). In a preferred 
embodiment the polymers are homogeneous in backbone composition and are, for example, nucleic 
acids, polypeptides, polysaccharides, carbohydrates, polyurethanes, polycarbonates, polyureas, 
polyethyleneimines, polyarylene sulfides, polysiloxanes, polyimides, polyacetates, polyamides, 

5 polyesters, or polythioesters. In the most preferred embodiments, the polymer is a nucleic acid or 
a polypeptide. A "nucleic acid"as used herein is a biopolymer comprised of nucleotides, such as 
deoxyribose nucleic acid (DNA) or ribose nucleic acid (RNA). A polypeptide as used herein is a 
biopolymer comprised of linked amino acids. 

As used herein with respect to linked units of a polymer, "linked" or "linkage" means two 

10 entities are bound to one another by any physicochemical means. Any linkage known to those of 
ordinary skill in the art, covalent or non-covalent, is embraced. Such linkages are well known to 
those of ordinary skill in the art. Natural linkages, which are those ordinarily found in nature 
connecting the individual units of a particular polymer, are most common. Natural linkages include, 
for instance, amide, ester and thioester linkages. The individual units of a polymer analyzed by the 

15 methods of the invention may be linked, however, by synthetic or modified linkages. Polymers 
where the units are linked by covalent bonds will be most common but also include hydrogen 
bonded, etc.. 

The polymer is made up of a plurality of individual units. An "individual unit" as used 
herein is a building block or monomer which can be linked directly or indirectly to other building 
20 blocks or monomers to form a polymer. The polymer preferably is a polymer of at least two 
different linked units. The at least two different linked units may produce or be labeled to produce 
different signals, as discussed in greater detail below. The particular type of unit will depend on the 
type of polymer. For instance DNA is a biopolymer comprised of a deoxyribose phosphate 
backbone composed of units of purines and pyrimidines such as adenine, cytosine, guanine, thymine, 

25 5-methylcytosine, 2-aminopurine, 2-amino-6-chloropurine, 2,6-diaminopurine, hypoxanthine, and 
other naturally and non-naturally occurring nucleobases, substituted and unsubstituted aromatic 
moieties. RNA is a biopolymer comprised of a ribose phosphate backbone composed of units of 
purines and pyrimidines such as those described for DNA but wherein uracil is substituted for 
thymidine. DNA units may be linked to the other units of the polymer by their 5' or 3' hydroxyl 

30 group thereby forming an ester linkage. RNA units may be linked to the other units of the polymer 
by their 5', 3* or 2' hydroxyl group thereby forming an ester linkage. Alternatively, DNA or RNA 
units having a terminal 5', 3' or 2' amino group may be linked to the other units of the polymer by 
the amino group thereby forming an amide linkage. The individual units of a polypeptide are amino 
acids, including the 20 naturally occurring amino acids as well as modified amino acids. Amino 

35 acids may exist as amides or free acids and are linked to the other units in the backbone of the 



RNpnOOin- <WO oo^prn?*? I 



« » 



WO 98/35012 PCT/US98/03024 

-29- 

polymers through their a-amino group thereby forming an amide linkage to the polymer. 
A "plurality of individual units" is at least two units linked to one another. 
Whenever a nucleic acid is represented by a sequence of letters it will be understood that the 
nucleotides are in 5'-» 3' order from left to right and that "A" denotes adenosine, "C" denotes 
5 cytidine, "G" denotes guanosine, "T" denotes thymidine, and "IT denotes uracil unless otherwise 
noted. 

The polymers may be native or naturally-occurring polymers which occur in nature or non- 
naturally occurring polymers which do not exist in nature. The polymers typically include at least 
a portion of a naturally occurring polymer. The polymers can be isolated or synthesized de novo. 
10 For example, the polymers can be isolated from natural sources e.g. purified, as by cleavage and gel 
separation or may be synthesized e.g.,(i) amplified in vitro by, for example, polymerase chain 
reaction (PCR); (ii) synthesized by, for example, chemical synthesis; (iii) recombinantly produced 
by cloning, etc. 

The polymer or at least one unit thereof is in a form which is capable of interacting with an 
15 agent or station to produce a signal characteristic of that interaction. The unit of a polymer which 
is capable of undergoing such an interaction is said to be labeled. If a unit of a polymer can undergo 
that interaction to produce a characteristic signal, then the polymer is said to be intrinsically labeled. 
It is not necessary that an extrinsic label be added to the polymer. If a non-native molecule, 
however, must be attached to the individual unit of the polymer to generate the interaction producing 
20 the characteristic signal, then the polymer is said to be extrinsically labeled. The "label" may be, 
for example, light emitting, energy accepting, fluorescent, radioactive, or quenching. 

Many naturally occurring units of a polymer are light emitting compounds or quenchers. For 
instance, nucleotides of native nucleic acid molecules have distinct absorption spectra, e.g., A, G ? 
T, C, and U have absorption maximums at 259 nm, 252 nm, 267 nm, 271 nm, and 258 nm 
25 respectively. Modified units which include intrinsic labels may also be incorporated into polymers. 
A nucleic acid molecule may include, for example, any of the following modified nucleotide units 
which have the characteristic energy emission patterns of a light emitting compound or a quenching 
compound: 2,4-dithiouraciI, 2,4-Diselenouracil, hypoxanthine, mercaptopurine, 2-aminopurine, and 
selenopurine. 

30 A unit may also be considered to be intrinsically labeled when a property of the unit other 

than a light emitting, quenching or radioactive property provides information about the identity of 
the unit without the addition of an extrinsic label. For instance the shape and charge of the unit 
provides information about the unit which can result in a specific characteristic signal, such as a 
change in conductance arising from the blockage of a conductance path by the unit. 

35 If an extrinsic label is selected for use according to the methods of the invention, the type of 
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extrinsic label selected will depend on a variety of factors, including the nature of the analysis being 
conducted, the type of the agent and the type of polymer. Extrinsic label compounds include but are 
not limited to light emitting compounds, quenching compounds, radioactive compounds, spin labels, 
and heavy metal compounds. The label should be stearically compatible and chemically compatible 
5 with the units of the polymer being analyzed. 

A "light emissive compound" as used herein is a compound that emits light in response to 
irradiation with light of a particular wavelength. These compounds are capable of absorbing and 
emitting light through phosphorescence, chemiluminescence, luminescence, polarized fluorescence, 
scintillators or, more preferably, fluorescence. The particular light emissive compound selected will 
10 depend on a variety of factors which are discussed in greater detail below. Light emissive 
compounds have been described extensively in the literature. For example, Haugland, R. P., 
Handbook of Fluorescent Probes and Research Chemicals, 6th edition, Molecular Probes, Inc., 
1996, which is hereby incorporated by reference provides a description of light emitting compounds. 
Radioactive compounds are substances which emit alpha, beta, or gamma nuclear radiation. 
] 5 Alpha rays are positively charged particles of mass number 4 and slightly deflected by electrical and 
magnetic fields. Beta rays are negatively charged electrons and are strongly deflected by electrical 
and magnetic fields. Gamma rays are photons of electromagnetic radiation and are undeflccted by 
electrical and magnetic fields and are of wavelength of the order to 10" 8 to 10" 9 cm. 

Generally, fluorescent dyes are hydrocarbon molecules having a chain of several conjugated 
20 double bonds. The absorption and emission wavelengths of a dye are approximately proportional 
to the number of carbon atoms in the conjugated chain. A preferred fluorescent compound is "Cy-3" 
(Biological Detection Systems, Pittsburgh, PA). Other preferred fluorescent compounds useful 
according to the invention include but are not limited to fluorescein isothiocyanate ("FITC")* Texas 
red, tetramethylrhodamine isothiocyanate ("TRITC"), 4, 4-difluoro-4-bora-3a, and 4a-diaza-s- 
25 indacene ("BODIPY"). 

Chemiluminescent compounds are compounds which luminesce due to a chemical reaction. 
Phosphorescent compounds are compounds which exhibit delayed luminescence as a result of the 
absorption of radiation. Luminescence is a non-thermal emission of electromagnetic radiation by 
a material upon excitation. These compounds are well known in the art and are available from a 
30 variety of sources. 

In one embodiment of the invention the light emissive compound is a donor or an acceptor 
fluorophore. A fluorophore as used herein is a molecule capable of absorbing light at one 
wavelength and emitting light at another wavelength. A donor fluorophore is a fluorophore which 
is capable of transferring its fluorescent energy to an acceptor molecule in close proximity. An 
35 acceptor fluorophore is a fluorophore that can accept energy from a donor at close proximity. (An 
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acceptor of a donor fluorophore does not have to be a fluorophore. It may be non-fluorescent.) 
Fluorophores can be photochemical ly promoted to an excited state, or higher energy level, by 
irradiating them with light. Excitation wavelengths are generally in the UV, blue, or green regions 
of the spectrum. The fluorophores remain in the excited state for a very short period of time before 
5 releasing their energy and returning to the ground state. Those fluorophores that dissipate their 
energy as emitted light are donor fluorophores. The wavelength distribution of the outgoing photons 
forms the emission spectrum, which peaks at longer wavelengths (lower energies) than the excitation 
spectrum, but is equally characteristic for a particular fluorophore. 

In another embodiment of the invention the unit is labeled with a radioactive compound. The 

m radioactive compound emits nuclear radiation as it passes the agent or station. When the agent is 
a scintillation layer the nuclear radiation interacts with the scintillation layer and causes fluorescent 
excitation. A fluorescent signal indicative of the radioactively labeled unit can then be detected. 

Extrinsic labels can be added to the polymer by any means known in the art. For example, 
the labels may be attached directly to the polymer or attached to a linker which is attached to the 

15 polymer. For instance, fluorophores have been directly incorporated into nucleic acids by chemical 
means but have also been introduced into nucleic acids through active amino or thio groups into a 
nucleic acid. Proudnikov and Mirabekov, Nucleic Acids Research, 24: 4535-4532 (1 996). Modified 
units which can easily be chemically derivitized or which include linkers can be incorporated into 
the polymer to enhance this process. An extensive description of modification procedures which can 

20 be performed on the polymer, the linker and/or the extrinsic label in order to prepare a bioconjugatc 
can be found in Hermanson, G. T., Bioconjugate Techniques, Academic Press. Inc.. San Diego, 
iwci. which is hereby incorporated by reference. 

There are several known methods of direct chemical labeling of DNA (Hermanson, 1996; 
Rogel el aU 1989; Proudnikov and Mirzzbekov, 1996). One of the methods is based on the 

25 introduction of aldehyde groups by partial depurination of DNA. Fluorescent labels with an attached 
hydrazine group are efficiently coupled with the aldehyde groups and the hydrazonc bonds are 
stabilized by reduction with sodium labeling efficiencies around 60%. The reaction of cytosine with 
bisulfite in the presence of an excess of an amine fluorophore leads to transamination at the N 4 
position (Hermanson, 1996). Reaction conditions such as pH, amine fluorophore concentration, and 

30 incubation time and temperature affect the yield of products formed. At high concentrations of the 
amine fluorophore (3M), transamination can approach 100% yield (Draper and Gold. 1980). 

Light emissive compounds can be attached to polymers or other materials by any mechanism 
known in the art. For instance, functional groups which are reactive with various light emissive 
groups include, but are not limited to, (functional group: reactive group of light emissive compound) 

35 activated cstenamines or anilines; acyl azide:amines or anilines; acyl halide:amines, anilines, 
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alcohols or phenols; acyl nitriieialcohols or phenols; aldehyde:amines or anilines; alkyl 
halide:amines, anilines, alcohols, phenols or thiols; alkyl sulfonaterthiols, alcohols or phenols; 
anhydride:alcohols, phenols, amines or anilines; aryl halide:thiols; aziridine:thiols or thioethers; 
carboxylic acid:amines, anilines, alcohols or alkyl halides; diazoalkanercarboxylic acids; 
5 epoxide:thiols; haloacetamide:thiols; halotriazineramines, anilines or phenols; hydrazine:aldehydes 
or ketones; hydroxyamine:aldehydes or ketones; imido estenamines or anilines; isocyanateramines 
or anilines; and isothiocyanateiamines or anilines. 

The agent that interacts with the unit of the polymer is selected from the group consisting of 
electromagnetic radiation, a quenching source, and a fluorescence excitation source. 
10 ''Electromagnetic radiation" as used herein is energy produced by electromagnetic waves. 
Electromagnetic radiation may be in the form of a direct light source or it may be emitted by a light 
emissive compound such as a donor fluorophorc. "Light" as used herein includes electromagnetic 
energy of any wavelength including visible, infrared and ultraviolet. 

As used herein, a quenching source is any entity which alters or is capable of altering a 
15 property of a light emitting source. The property which is altered can include intensity fluorescence 
lifetime, spectra, fluorescence, or phosphorescence. 

A fluorescence excitation source as used herein is any entity capable of fluorescing or giving 
rise to photonic emissions (i.e. electromagnetic radiation, directed electric field, temperature, 
fluorescence, radiation, scintillation, physical contact, or mechanical disruption.) For instance, when 
20 the unit is labeled with a radioactive compound the radioactive emission causes molecular excitation 
of an agent that is a scintillation layer which results in fluorescence. 

When a unit of the polymer is exposed to the agent the interaction between the two produces 
a signal. The signal provides information about the polymer. For instance if all units of a particular 
type, e.g., all of the alanines, of a protein polymer are labeled (intrinsic or extrinsic) with a particular 
25 light emissive compound then when a signal characteristic of that light emissive compound is 
detected upon interaction with the agent the signal signifies that an alanine residue is present at that 
particular location on the polymer. If each type of unit e.g., each type of amino acid is labeled with 
a different light emissive compound having a distinct light emissive pattern then each amino acid 
will interact with the agent to produce a distinct signal. By determining what each signal for each 
30 unit of the polymer is, the sequence of units can be determined. 

The interaction between the unit and the agent can take a variety of forms, but does not 
require that the unit and the agent physically contact one another. Examples of interactions are as 
follows. A first type of interaction involves the agent being electromagnetic radiation and the unit 
of the polymer being a light emissive compound (either intrinsically or extrinsically labeled with a 
35 light emissive compound). When the light emissive unit is contacted with electromagnetic radiation 
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(such as by a laser beam of a suitable wavelength or electromagnetic radiation emitted from a donor 
fluorophore), the electromagnetic radiation causes the light emissive compound to emit 
electromagnetic radiation of a specific wavelength. The signal is then measured. The signal exhibits 
a characteristic pattern of light emission and thus indicates that a particular labeled unit of the 
5 polymer is present. In this case the unit of the polymer is said to "detectably affect the emission of 
the electromagnetic radiation from the light emissive compound." 

A second type of interaction involves the agent being a fluorescence excitation source and 
the unit of the polymer being a light emissive or a radioactive compound. When the light emissive 
unit is contacted with the fluorescence excitation source, the fluorescence excitation source causes 

10 the light emissive compound to emit electromagnetic radiation of a specific wavelength. When the 
radioactive unit is contacted with the fluorescence excitation source, the nuclear radiation emitted 
from the unit causes the fluorescence excitation source to emit electromagnetic radiation of a specific 
wavelength. The signal then is measured. 

A variation of these types of interaction involves the presence of a third element of the 

15 interaction, a proximate compound which is involved in generating the signal. For example, a unit 
may be labeled with a light emissive compound which is a donor fluorophore and a proximate 
compound can be an acceptor fluorophore. If the light emissive compound is placed in an excited 
state and brought proximate to the acceptor fluorophore, then energy transfer will occur between the 
donor and acceptor, generating a signal which can be detected as a measure of the presence of the 

20 unit which is light emissive. The light emissive compound can be placed in the ''excited" state by 
exposing it to light (such as a laser beam) or by exposing it to a fluorescence excitation source. 

Another interaction involves a proximate compound which is a quenching source. In this 
instance, the light emissive unit is caused to emit electromagnetic radiation by exposing it to light. 
If the light emissive compound is placed in proximity to a quenching source, then the signal from 

25 the light emissive unit will be altered. 

A set of interactions parallel to those described above can be created wherein, however, the 
light emissive compound is the proximate compound and the unit is either a quenching source or an 
acceptor source. In these instances the agent is electromagnetic radiation emitted by the proximate 
compound, and the signal is generated, characteristic of the interaction between the unit and such 

30 radiation, by bringing the unit in interactive proximity with the proximate compound. 

The mechanisms by which each of these interactions produces a detectable signal is known 
in the art. For exemplary purposes the mechanism by which a donor and acceptor fluorophore 
interact according to the invention to produce a detectable signal including practical limitations 
which are known to result from this type of interaction and methods of reducing or eliminating such 

35 limitations is set forth below. 
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In a preferred embodiment the signal generated by the interaction between the unit and the 
agent results from fluorescence resonance energy transfer (FRET) between fluorophores. Either the 
unit or the proximate compound/agent may be labeled with either the donor or acceptor fluorophore. 
FRET is the transfer of photonic energy between fluorophores. FRET has promise as a tool in 
5 characterizing molecular detail because of its ability to measure distances between two points 
separated by 10 A to 100 A. The angstrom resolution of FRET has been used in many studies of 
molecular dynamics and biophysical phenomena (for reviews see Clegg, 1995; Clegg, 1992; Selvin, 
1995; and Wu and Brand, 1994). The resolving power of FRET arises because energy transfer 
between donor and acceptor fluorophores is dependent on the inverse sixth power of the distance 
10 between the probes. In practice, this resolution is about an order of magnitude better than that of the 
highest resolution electron microscope. 

In order to undergo FRET, the emission spectrum of the donor overlaps with the excitation 
spectrum of the acceptor. The unit of the polymer is specifically labeled with an acceptor 
fluorophore. The agent is a donor fluorophore. A laser is tuned to the excitation wavelength of the 
1 5 donor fluorophore. As the polymer is moved through the channel, the donor fluorophore emits its 
characteristic wavelength. As the acceptor fluorophore moves into interactive proximity with the 
donor fluorophore, the acceptor fluorophore is excited by the energy from the donor fluorophore. 
The consequence of this interaction is that the emission of the donor fluorophore is quenched and 
that of the acceptor fluorophore is enhanced. 
20 In order to generate an optimal efficient FRET signal for detection, two conditions should 

be satisfied. The first condition is efficient donor emission in the absence of acceptors. The second 
is efficient generation of a change in either donor or acceptor emissions during FRET. 

In a model system, both the donor fluorophore and the acceptor fluorophore can give rise to 
photonic emissions indefinitely. In such circumstances, during energy transfer, the decrease in the 
25 donor emission is equal to the increase in the acceptor emission. In other words, the emission 
change is identical for both the donor and the acceptor. This is so because for every one donor 
emission quenching event, there is an equal and opposite acceptor emission event. Cantor and 
SchtmmcL 1 ( >SO present an intuitive outline of this model system. Consider a system of two 
fluorophores. a donor and an acceptor. At a range within Forster distance, very weak coupling 
30 occurs. The interaction between the donor and the acceptor is summarized as: 

k t 

D b + A a ** D a + A b 
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where D a and A a are the ground singlet states and D b and A h are the first excited singlet states. A>and 
k. r are the forward and reverse rates of energy transfer. The above reaction states that for a given 
donor in the first excited state, it will rapidly lose energy by internal conversion until it reaches its 
ground state, D ir Since the acceptor absorption energies overlap with the donor emission energies, 
5 very weak coupling occurs, allowing resonance energy transfer to take place. 

As a result of the resonance energy transfer, both the donor (D h ) and the acceptor (A h ) are in 
excited vibrational states. Vibrational relaxation rapidly brings these to their respective ground 
vibrational levels. Vibrational relaxation is very efficient, with the average lifetime of an excited 
vibrational state being only about 10" 15 s (Skoog et al., 1992). Internal conversion, on the other hand, 
10 for fluorescent molecules is about 10" 9 s. The difference in these rates means that even when k r is 
very efficient, the reverse reaction (£_ 7 ) is unlikely to occur. In this ideal system, there is thus a shift 
of the relative population of excited donors and acceptors. 

The above analysis sets forth a system utilizing a single acceptor and a single donor. The 
same type of interaction may be performed on a system which utilizes many donors and one 
15 acceptor. Whenever the acceptor is in its ground singlet state (A J, energy transfer can occur. This 
means that for an acceptor with a lifetime of 1 ns, it can be excited 1 x 10 9 times per second, a very 
large rate compared to that of a donor fluorophore under standard illumination (-25,000 excitation 
cvents/s). Since the maximum number of excitations that an acceptor can undergo is much greater 
than that of one donor fluorophore, multiple donors can transfer their energy to one acceptor. 

20 Although acceptor excitation to higher energy levels (A x ) and simultaneous donor de- 

excitation events may be considered to be a potential problem resulting in the interference with 
signals generated in a system using multiple donors and a single acceptor the following analysis 
demonstrates that this is not the case. It is improbable that excitation to higher energy levels and 
simultaneous donor de-excitation events will interfere with signal generation. 

25 Acceptor excitation to higher levels means that there theoretically could be further excitation 

v from the acceptor's first singlet state (A h ) to the acceptor's higher singlet states (A^. If this were 
indeed possible, then the maximum number of excitation events in a given time increases because 
the effective lifetime is shorter. A shorter effective lifetime arises because the acceptor no longer 
has to de-excite to undergo energy transfer. Such transitions in the acceptor can only occur if the 

30 acceptor absoiption energies for a transition were coincident with the donor emission energies, 
which is generally not the case for most molecules. As a consequence, energy transfer between the 
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donors and the acceptor can only occur when the acceptor is in its ground singlet state. 

In cases of simultaneous donor de-excitation, only one of the donors can transfer energy to 
the acceptor. This means that the other donor is not quenched and emits radiatively. If simultaneous 
events occurred frequently, then the above scenario for multi-donor quenching would not be as 
5 accurate because a fraction of the donors would be able to emit photons. The following probability 
calculations demonstrate that such events are statistically infrequent, supporting the use of multi- 
donor quenching. 

probability of k simultaneous donor 
emissions occuring in one ns, where n = P nk = (P k )( ft Ck) 
10 is the number of fluorophores 

k-n 

number of donor emissions = (2-1)P n2 t = (3-1)P n3 t + ... +(n-1)P nn t = Y (k-1)P nk t 
that cannot undergo energy k-2 
1 5 transfer 

The aim of these calculations is to calculate the number of simultaneous donor de-excitation 
events for a given amount of time. This number is important because the greater the number of 
overlapping events, the smaller the percentage of donor energy transfer. For example it is possible 

20 to find the probability of two donor emissions (k=2) simultaneously occurring in one nanosecond 
for four fluorophores (n=4; thus P n ,k=P 4 ,2). Each emission, in this example, lasts for one nanosecond 
and are represented as blue circles in the grids. In addition, each fluorophore emits an average of 
25,000 photons/second randomly. The variables are defined as n = number of fluorophores, k = 
number of simultaneous donor emissions, P = probability, C = combination, and t = time. The 

25 probability for k simultaneous events occurring in one ns for n fluorophores (P n k ) is equal to the 
probability for k simultaneous events (P k ) multiplied by the number of possible combinations that 
k simultaneous events can occur for n fluorophores (nQ). The total number of simultaneous events 
is given as the sum of probabilities for all possible values of simultaneous events. This gives the 
probability in the units of (number of events)/ns. The total number of fluorophores undergoing 

30 simultaneous emission is thus the probability for one ns multiplied by the given time (t). It is also 
necessary to consider the probability of greater than two or more donor emissions (P 4 3 and ? 4 4 ) 
occurring in a given time (t). From this information, the number of donor fluorophores that cannot 
undergo energy transfer because of simultaneous de-excitation can be derived. 

The probability for two simultaneous donor emissions in one nanosecond for four 
35 fluorophores is given as: 
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The probability that one ns will contain one emission is 25,000/1 0 9 . The probability that two 
5 emissions will occur in the same nanosecond is the square (25,000/1 0 9 ) 2 . Since there are four 
different fluorophores, there are 6 different possible combinations that give rise to simultaneous 
emission (4!/2!2!). These values multiplied give 3.75 x 10~ 9 events/ns. This value multiplied by 10 9 
ns yields 3.75 events/s. In this case, there are two emissions/event (k=2) and one of these (k-1) 
cannot undergo energy transfer so the total number of donor emissions that do not undergo FRET 
10 is 3.75 events/s. 

The above simple calculation can be applied to a system with a greater number of 
fluorophores. In this case, one thousand fluorophores are considered (n=l 000). In a similar fashion, 
P, 0002 is calculated to be 3.122 x 10* 4 events/ns. The number of donor emissions that cannot undergo 
energy transfer is 3.122 x 10\ In some cases, three simultaneous donor emissions can occur. The 
15 probability for these events (P| 0 ocu) is calculated: 

= ( 25000 } 3 ( J000!_ } = 2 596 x 1Q _ 6 
10 9 3!997! 

The number of donor emissions that cannot occur is (3-l)(10 9 )(2.596xl0 6 ). or 5.192 x 10 3 . In a 
similar fashion, calculations for four simultaneous emissions or greater need to be included in the 
net number of donor fluorophores that cannot undergo energy transfer. This number is expressed 
as the following equation: 

20 k-n 

= (2-l)P n . 2 t = (3-l)P nJ t + ... +(n-l)P M t - E (k-l)P n k t 

k-2 

Solving the above summations yields 3.174 x 10 5 donor emissions that do not undergo 



BNSDOC1D: <WO 993501 2A2 t > 



WO 98/35012 PCT7US98/03024 

-38- 

energy transfer. The donor emission is calculated from the following values: 25 000 photons/s per 
fluorophore, 1 000 fluorophores, and a 1 second time interval. This means that 2.5 x 1 0 7 photons are 
emitted per second. The conclusion is that 98.7% of donor events can undergo energy transfer. 

The conclusion from the above discussion of a multiple donor/single acceptor system is that 
5 one acceptor can undergo energy transfer with multiple donors. The limitations that were considered 
were the ability to excite an acceptor to higher energy levels and also the limitations of simultaneous 
donor de-excitation. In donor emissions that occur concurrently, only one of the de-excitations 
would be able to undergo energy transfer with the acceptor. If this occurred often, then the acceptor 
would not be able to undergo energy transfer with all donor emission events. Probability 
10 calculations show that for a one thousand donor/one acceptor system under standard epiillumination 
conditions, 98.7% of donor emission events can undergo energy transfer with an acceptor. The 
overall conclusion from these detailed analyses is that a multi-donor system/one acceptor system can 
indeed exist and that a single acceptor can undergo energy transfer with a large number of donor 
molecules. 

15 The following description sets forth how conditions can be optimized for the donor and the 

acceptor fluorophores of the multi-donor/one acceptor system. In order to optimize the system two 
potential problems which may arise when this system is actually performed should be considered. 
These are photobleaching and solvent quenching (Rost, 1990; Menter et al., 1978; Giloh and Sedat, 
1982; Vaughan and Weber, 1970; Guilbault, 1973; Udenfriend 1962: Pringsheim, 1963). These 

2o effects may limit the number of excitation cycles a fluorophore can undergo. Whereas in an ideal 
system a fluorophore can undergo infinite excitation cycles, in practice, the actual number of cycles 
is limited to a finite number depending on both the solvent conditions and intensity of the excitation 
light. The signal produced in a multiple donor/one acceptor system, however, can be optimized 
under certain experimental conditions. / 

25 The theory for optimizing and generating an efficient signal follows. The rationale is that 

as long as model conditions are simulated as close as possible, signals are generated. Recall that 
ideal fluorophores can undergo infinite cycles of excitation. If realistic fluorophores behave in a 
similar fashion, then any number of fluorophores can be detected as long as temporal integration of 
photonic emissions is sufficient. One method for achieving close to ideal conditions is by 
• 30 immobilizing fluorophores in a solid medium. Embedding fluorophores in solid media can abolish 
both photobleaching and quenching (Haughland, 1996; Garland and Moore, 1979; Rost, 1991) and 
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that certain types of fluorophores are especially stable (such as ferulic acid in plant cell walls, uranyl 
glass, and Acriflavine) (Rost, 1991). This will dramatically increase the number of possible 
excitation cycles. Despite being embedded in a solid medium, donor fluorophores can generate 
fluorescent signals for several hours (Rost, 1991) and can still undergo FRET with acceptors (Stryer, 
5 1 978; Cantor and Schimmel, 1980) and vise versa. Furthermore it is known that the acceptor can 
undergo resonance energy transfer with donors for several hours in solvent conditions (Wittwer, 
1997; Lee et ah, 1994; Uchiyama et al., 1996; Livak et aL, 1997). The ability for an acceptor to 
quench a donor that has a stable signal translates into the ability to generate highly efficient signals. 

It has been shown that an acceptor can undergo energy transfer with a donor for an extended 

10 period of time in solvent conditions (Wittwer, 1997; Lee et al., 1 994; Uchiyama et al., 1 996; Livak 
et ah, 1997). The evidence arises from experiments done on peptide and nucleic acid cleavage 
assays. In these assays, a particular substrate is labeled at either end with a donor and an acceptor 
fluorophore. Since the length of the substrate is within range of the Forster distance of the donor- 
acceptor pair, the fluorescence of the donor is quenched. Monitoring the donor fluorescence allows 

15 a light based assay of cleavage. In this manner, an increase in donor fluorescence is directly 
proportional to the cleavage activity of the particular enzyme. Assays of this type, commonly called 
fluorometric cleavage assays, have been used to study a number of systems including HIV proteases 
(Matayoshi et al., 1990; Toth and Marshall, 1990), neutral proteases (Ng et al., 1989), EcoRV 
restriction endonucleases (Erskine and Halford, 1994), PacR7 endonuclease (Ghosh et al., 1994), 

20 DNA polymerase I 5'-3' cxonuclease activity (Wittwer et aL, 1997; Livak, 1997), thermolysin (Yang 
and Van Wart, 1994), and papain (Garcia-Echeverria and Rich, 1992). The relevance of these 
experiments is important because they contain evidence that an acceptor can quench a donor for 
extended periods of time. The controls in fluorometric cleavage assays include the monitoring of 
uncleaved substrates (in the absence of the cleavage enzymes) for the duration of the experiment. 

25 For especially lengthy experiments, these controls need to be monitored for an extended period of 
time. The lack of increase in donor fluorescence in these controls demonstrates directly that an 
acceptor can quench a donor for a significant length of time. 

The acceptor maintains its ability to transfer energy for an extended period of time because 
its energy states remain unchanged. Energy transfer occurs when the acceptor absorption energies 
30 are coincident with the donor emission energies. The result of energy transfer is a ground singlet 
state (A a ) to excited singlet state (A b ) conversion. The acceptor loses energy through internal 
conversion, either fluorescent or non-radiative. The studies cited above demonstrate that despite 
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non-ideal experimental conditions, the acceptors maintain their relative energy levels and hence their 
ability to undergo energy transfer. 

The number of excitation cycles that a fluorophore generally undergoes in a solution is 
roughly 35,000 cycles (Rost, 1991). The number of fluorophore excitation cycles in solution, 

5 however, is limited by the solvent effects discussed above such as oxygen quenching, collisional 
quenching, and excited state reactions. Solvent conditions can be adjusted so that the number of 
excitation cycles is optimized (Haughland, 1996). Various chemical methods are used to optimize 
conditions for detection of acceptor fluorescence. Sodium azide (NaN 3 ), sodium iodide (Nal), 
dithiothreitol (DTT), dithioerythritol (DTE), sodium dithionate, ^-propyl gallate, ascorbic acid, and 

10 polyvinyl alcohol (PVA) all have been found effective with various fluorophores (Bock et ak, 1985; 
Johnson et ah, 1982; Picciolo and Kaplan, 1984; Gill, 1979; Giloh and Sedat, 1982; Valnes and 
Brandtzaeg, 1985). p-mercaptoethanol, sodium nitroprusside, and incorporation of electron donors 
and molecules with SH groups have also been found to be effective (Franklin and Filion, 1985; Spatz 
and Grabig, 1983; Hamada and Fujita, 1 983). In addition, commercial reagents for reducing solution 

15 fluorophore fading are available. SlowFade (Molecular Probes, OR) formulation reduces the fading 
rate of fluorescein to almost zero. Because it provides a nearly constant emission intensity from 
fluorescein, the SlowFade reagent is especially useful for quantitative measurements and 
applications that employ a confocal laser scanning microscope, in which the excitation intensities 
can be extreme and prolonged. For a system under the proposed optimized conditions, both initial 

20 donor emission and the change in donor emission in the presence of acceptors have efficiencies close 
to that of ideal conditions. 

A "detectable signal" as used herein is any type of signal which can be sensed by 
conventional technology. The signal produced depends on the type of agent or station as well as the 
unit and the proximate compound if present. In one embodiment the signal is electromagnetic 

25 radiation resulting from light emission by a labeled (intrinsic or extrinsic) unit of the polymer or by 
the proximate compound. In another embodiment the signal is fluorescence resulting from an 
interaction of a radioactive emission with a scintillation layer. The detected signals may be stored 
in a database for analysis. One method for analyzing the stored signals is by comparing the stored 
signals to a pattern of signals from another polymer to determine the relatedness of the two 

30 polymers. Another method for analysis of the detected signals is by comparing the detected signals 
to a known pattern of signals characteristic of a known polymer to determine the relatedness of the 
polymer being analyzed to the known polymer. Comparison of signals is discussed in more detail 
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below. 

More than one detectable signal may. be detected. For instance a first individual unit may 
interact with the agent or station to produce a first detectable signal and a second individual unit may 
interact with the agent or station to produce a second detectable signal different from the first 

5 detectable signal. This enables more than one type of unit to be detected on a single polymer. 

The detectable signal is produced at a station. A "station" as used herein is a region where 
a portion of the polymer to be detected, e.g. the unit, is exposed to, in order to produce a signal or 
polymer dependent impulse. The station may be composed of any material including a gas. 
Preferably the station is a non-liquid material. "Non-liquid" has its ordinary meaning in the art. A 

10 liquid is a non-solid, non-gaseous material characterized by free movement of its constituent 
molecules among themselves but without the tendency to separate. In another preferred embodiment 
the station is a solid material. In one embodiment when the unit interacts with an agent the station 
is an interaction station. The station may also be a signal generation station, which is discussed in 
more detail below. As used herein an "interaction station or site" is a region where a unit of the 

15 polymer and the agent can be positioned with respect to one another in close enough proximity 
whereby they can interact. The interaction station for fluorophores, for example, is that region where 
they are close enough so that they energetically interact to produce a signal. 

The interaction station in a preferred embodiment is a region of a nanochannel where a 
localized agent, such as an acceptor fluorophore, attached to the wall forming the channel, can 

20 interact with a polymer passing through the channel. The point where the polymer passes the 
localized region of agent is the interaction station. As each labeled unit of the polymer passes by 
the agent a detectable signal is generated. The agent may be localized within the region of the 
channel in a variety of ways. For instance the agent may be embedded in the material that forms the 
wall of the channel or the agent may be attached to the surface of the wall material. Alternatively 

25 the agent may be a light source which is positioned a distance from the channel but which is capable 
of transporting light directly to a region of the channel through a waveguide. An apparatus may also 
be used in which multiple polymers are transported through multiple channels. These and other 
related embodiments of the invention are discussed in more detail below. The movement of the 
polymer may be assisted by the use of a groove or ring to guide the polymer. 

30 Other arrangements for creating interaction stations are embraced by the invention. For 

example, a polymer can be passed through a molecular motor tethered to the surface of a wall or 
embedded in a wall, thereby bringing units of the polymer sequentially to a specific location, 
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preferably in interactive proximity to a proximate agent, thereby defining an interaction station. A 
molecular motor is a compound such as polymerase, helicase, or actin which interacts with the 
polymer and is transported along the length of the polymer past each unit. Likewise, the polymer 
can be held from movement and a reader can be moved along the polymer, the reader having 

5 attached to it the agent. For instance the agent may be held within a scanning tip that is guided along 
the length of the polymer. Interaction stations then are created as the agent is moved into interactive 
proximity to each unit of the polymer. 

Once the signal is generated it can then be detected. The particular type of detection means 
will depend on the type of signal generated which of course will depend on the type of interaction 

10 which occurs between the unit and the agent. Many interactions involved in the method of the 
invention will produce an electromagnetic radiation signal. Many methods are known in the art for 
detecting electromagnetic radiation signals, including two- and three-dimensional imaging systems. 

Three-dimensional imaging systems for imaging fluorescence in biological systems has 
recently been described. Dickson et. al., have described three-dimensional imaging of single 

15 molecules in the pores of a gel. Dickson et. al., Science, 274:966-969 (1996). Dickson et al. 
examine the effects of confined environments on Brownian motion of single fluorescent molecules. 
Single molecules of free fluorescent dye and fluorescent dye bound to protein were trapped in 
polyacrylamide (PAA) gels. Nile red was incorporated into polyacrylamide gels having pores on 
the order of 2 nm. The fluorophore gels were excited with an evanescent wave generated by total 

20 internal reflection (TIR) to detect movement or immobilization of single molecules within the pores 
of the gels. The TIR method produces three-dimensional information about the molecules in the 
pores by using the exponential fall-off in excitation intensity to determine the distance of the 
fluorophore from the interface. The discrepancy between observed motion and theoretical Brownian 
motion was studied by both cases. The Brownian motion for free fluorophores was reduced by a 

25 factor of 10 4 . The labeled proteins remained completely stationary in space. By direct analogy, the 
spatial confinement of the nanochannels should limit or eliminate the Brownian motion of the 
labeled DNA in nanochannel FRET sequencing. This would allow a stable and predictable passage 
of the DNA through the nanochannels. 

An example of a microscopy system useful according to the methods of the invention is 

30 provided in Ishijima et al., Cell, 92:161-171 (1998). 

Two-dimensional imaging systems are important because they have, among other parameters, 
low noise, high quantum efficiency, proper pixel-to-image correlation, and efficient processing 
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times. An example of a device- useful for detecting signals is a two-dimensional fluorescence 
imaging system which detects electromagnetic radiation in the fluorescent wavelength range. 

There are three categories of fluorescence imaging devices based on the type of fluorescence 
signal measured, either intensity, lifetime, or spectra. Intensity signals can be captured by a variety 

5 of methods including charge coupled device (CCD) camera, streak cameras, and silicon diode arrays. 
Of these methods, the most common is the CCD camera because of its wide commercial 
applications. The streak camera offers a superior temporal resolution down to the femtosecond. 
Silicon diode arrays have superior dynamic range, signal to noise ratios, and temporal resolution 
(common frame rates at 1000/s), but have larger picture elements (25 jiM x 500 jaM as opposed to 

10 20 x 20 |iM for a CCD). Each of the devices may be used with the methods of the invention but 
CCD cameras are preferred. 

Lifetime and spectral imaging are performed using a combination of instruments including 
gated image intensifiers, pulsed lasers, and CCD cameras. Time-gated methods, which are lifetime- 
related techniques, involve temporally discriminating fluorescence signals from background and 

15 auto fluorescence. Periasamy et. al., which is hereby incorporated by reference provides a review of 
Time-gated fluorescence microscopy for clinical imaging. Periasamy et. al., Microscopy and 
Analysis. 33-35 (1995). Lifetime imaging uses time-gating or phase-modulation techniques to 
determine fluorescence lifetime. Spectral imaging determines the emission spectrum at each pixel. 
Time-gated and lifetime imaging techniques offer many advantages to simple CCD imaging such 

20 as belter signal-to-noise ratios and greater dye specificity and thus also are preferred devices for the 
detection method of the invention. 

The first type of imaging technique, intensity imaging generally involves the use of a CCD 
or ICCD camera to independently capture intensity signals. This is a desirable method for detecting 
signals according to the methods of the invention because of its simplicity. CCD and ICCD cameras 

25 can be readily purchased from a number of suppliers (i.e. Photometries, Hamamatsu, Princeton 
Instruments). 

CCDs are two-dimensional silicon matrices that have many light sensitive elements called 
pixels that can hold electronic charges generated by photon interaction. Exposure of the CCD to 
photonic fluorescent emission causes accretion of charge in the individual pixels. After an exposure 
30 is complete, entire rows of accumulated charge are transported towards a serial CCD register. In the 
serial register, individual pixel charge packets are transported to a read-out amplifier that creates a 
signal proportional to the amount of charge. Each row of the CCD is read in a similar fashion until 
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the image is successfully converted to a two-dimensional series of signals. Charge transfer between 
the CCD and the serial CCD register is extremely efficient (99.9999% efficient). The output 
amplifier creates a linear response to the measured analog signals and the image is digitized between 
8 and 16 bits. 

5 Upon read-out of the image, there are two options that CCD cameras usually employ to 

modify images. These are subarray read-out and binning, or charge-grouping. A programmed CCD 
camera can selectively process pixels in a defined region of interest. The advantage of defining 
regions of interest is that it allows faster image read-out to occur. The time to process and digitize 
each pixel is fixed, so that smaller desired regions allow for higher frame rates. Smaller regions are 

10 possible from subarray read-out. For instance a 100 x 1 00 pixel in a 1024 x 1024 pixel CCD may 
be read at a frame rate of 100 frames/s at a pixel read frequency of 1 MHZ. Binning, which is a 
combination of smaller pixels into one effective larger pixel for faster readout times, on the other 
hand, allows combination of charge from several pixels. During binning, the CCD operates at 
reduced resolution for increased dynamic range and a higher frame rate. The dynamic range is the 

15 ratio of the largest signal a CCD can detect linearly to the basal readout noise. For example, a 1024 
x 1024 pixel CCD binned 4x4 yields a 256 x 256 image which has large pixels that are effectively 
16 times larger than the unbinned version. The image is also read-out at 16 times the rate. The 
specific operational mode of the CCD is dictated by the components of the camera and a host 
computer. 

20 To understand the control of the CCD, there are three components of a typical CCD that need 

to be discussed, including the camera controller, electronics unit, and camera head. The controller 
acts as the intermediary processor of input and output between the host computer and the camera. 
The controller contains logic which causes the camera to act on certain host commands. In addition 
it also relays digitized pixel data to the computer. Internally, the controller has sequences necessary 

25 for coordination of the CCD phases and timing of the analog processing units. The electronics unit 
convert digital commands into active clocking signals and sequences. Furthermore, the unit contains 
the A/D converter. Digitizers from 8 to 16 bits are commonly used. The camera head contains the 
CCD and often a cooling device. Liquid nitrogen or Peltier cooling are common. 

With a basic understanding of the operation of a CCD camera, performance is the most 
30 important consideration for the camera. Specifically, these parameters are noise, linearity, 
quantum efficiency, and temporal resolution. There are four components of noise that are 
important. They are dark current, read noise, shot noise, and lamp noise. Dark current is the 
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leakage current within the CCD. The charge accumulates even in the dark, hence the term. The 
lower the CCD temperature, the lower the dark current. Thermoelectric coolers or liquid nitrogen . 
can lower the CCD temperature to reduce the dark current. Temperatures of -120°C can reduce 
the dark current by several orders of magnitude. Multi-pin phasing (MPP) is a new CCD 

5 technology that can reduce dark current by a factor of 1 00 or more. Read noise is the stochastic 
electronic charge generated at higher frame rates arising from the output preamplifier. As the 
collection rate increases, so does the read noise. Read noise can be significantly reduced when 
the temperature is lowered below -60°C. Shot noise, also commonly called photonic noise, 
arises because of the quantum nature of light. It is the square root of the photonic signal. At low 

10 light levels, the relative shot noise is high and often masks the desired signal. Lamp noise is due 
to intensity fluctuations in the illuminating source and can be controlled by using stable power 
supplies. Lamp noise is very often overlooked in quantitative applications of CCD cameras. 

Photometric linearity becomes especially important during quantitative fluorescence 
microscopy. For use in the methods of the invention, concern for linearity is not overly 

15 important because in general linear operators are not being applied to the CCD data. Since the 
charge generation mechanism of a CCD is intrinsically linear, the output signal is precisely 
proportional to the charge. The electronics between the CCD and the digitizer provide room for 
deviations. High scientific grade CCDs usually exhibit linearity with less than 0.1% deviation. 
In most cases, the linearity is difficult to measure because it is more linear than techniques used 

20 to test it. 

Quantum efficiency is the fraction of photons that reach the CCD that interact to generate 
electron-hole pairs in the CCD. Quantum efficiencies range from 10% on low-grade CCDs to 
as high as 80% on back illuminated scientific grade CCDs. There are also spectral variances of 
the quantum efficiency, with peaks occurring usually in the visible wavelength. Back 
25 illuminated CCDs have much greater quantum efficiencies at all spectral positions. 

Temporal resolution of a CCD camera becomes important when discussing the possible 
frame rate of the camera. Exposure times vary for given applications and the exposure time can 
be adjusted by the number of fluorophores used and the intensity of the excitation light. The 
limitation to the temporal resolution in a given CCD camera is a function of the analog-to-digital 
30 converter operating frequency and also the image size. For example, an A/D converter operating 
at 1 MHZ can read 100 frames of a 100 x 100 image in one second. In addition, the higher the 
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dynamic range desired, the slower the read frequency. For example, it is difficult for current 
CCD cameras digitizing at 14 bits per pixel to operate above 1 MHZ. Use of multiple serial 
register devices overcomes the temporal limitations imposed by one AID converter. The speed 
increase with such a device is proportional to the number of available output channels. A 14 bit 
100 x 100 pixel image acquired by a CCD camera with 4 output registers operating at 1 MHZ 
can acquire 400 frames per second. The maximum rate limitation that cannot be overcome is 
dictated by the pixel read-out times, which vary from 20 |as to 500 ns. For a 100 x 100 pixel 
image, even the slowest pixel read-out time can allow 500 frames/s. Hence, read-out times are 
rarely a limitation on the frame rate, but rather it is the A/D converter that is limiting. As 
evidence of this, frame rates of 2000/s have been possible with a 128 x 128 array (Ichikawa et 
al., 1992). 

A CCD camera which is particularly useful in the methods of the invention is a large 
pixel, low noise, and short scan time camera. Large pixel sizes have greater well capacities and 
allow for greater collection of photons and hence a greater maximal signal-to-noise ratio (SNR). 
The basis of intensity change discrimination relies on a large SNR, as is to be discussed shortly. 
Accordingly, low noise aids in improving the SNR. MPP type CCDs with smaller well capacity 
and lower dynamic range are not well suited. The temporal resolution is high from the short scan 
time, allowing a high volume data stream that should be storable in real time in the computer 
RAM or on the hard drive. The quantum efficiency should be reasonable. The linearity of the 
CCD is not critical, but assumed to be scientific grade (0.1%). The dynamic range is not critical 
because the donor fluorophores that are images are approximately the same intensity. 

An intensified CCDs (ICCDs) camera consists of a photocathode, microchannel plate 
(MCI*), and a phosphor screen in addition to the CCD camera. Fluorescence light impinges on 
the photocathode, releasing photoelectrons into the MCP. The MCP is a secondary electron 
multiplier consisting of an array of millions of glass capillaries (channels) fused into the form 
of a thin disc. When photoelectrons are incident upon the channels, secondary electrons are 
produced. Passage of the electrons through the channels produces an amplification that is 
directly controlled as the gain of the instrument. Each channel of the MCP corresponds to a 
picture element. The output of the MCP is focused onto a phosphor screen where electrons 
exiting from the MCP strike. The optical image is reproduced on the phosphor screen and is 
captured by a CCD camera. 



WO 98/35012 PCT/US98/03024 

-47- 

In some instances, it is preferred to use a ICCD. The addition of the image intensifier 
offers high speed gating and high gain. Gating is probably the most important aspect of the 
ICCD. It is the electronic shutter action produced by controlling the input voltage to the image 
intensifier. At present, time-resolved imaging methods are possible with nanosecond and sub- 

5 nanosecond gating times (Thomas et al., 1992). Gating allows for greater signal detection 
because of temporal discrimination of background signals. Examples of temporal discrimination 
and enhancement of signal-to-noise ratios is seen in time-gated fluorescence microscopy and the 
pulse method of fluorescence lifetime imaging microscopy. The gain of the image intensifier is 
used to increase the sensitivity of the camera and also serves as a tool in the phase-modulated 

10 method of fluorescence lifetime imaging spectroscopy. The electronic gain can be as high as 
1 0,000: 1 . This allows faint optical signals to be amplified over read noise. 

Single photon events can be detected with an ICCD. Single fluorophore imaging, for 
instance, has been achieved by Sase et al., 1995 using a CCD camera, image intensifier, and an 
epifluorescence microscope. In "Real Time Imaging of Single Fluorophores on Moving Actin 

15 with an Epifluorescence Microscope," Sase et al. demonstrate that single fluorophores can be 
imaged in real time with a high detection efficiency. Other methods that have achieved solution 
single molecule sensitivity include fluorescence correlation spectroscopy (Eigen and Rigler, 
1994; Kinjo and Rigler, 1995), far-field confocal microscopy (Nie et al., 1994), cryogenic 
fluorescence spectroscopy (Kartha et al., 1 995), single molecule photon burst counting (Haab and 

:ti Mathies, 1995; Castro and Shera, 1995; Goodwin et al., 1995; Peck et al., 1989; Nguyen and 
Keller 1987; Lee et al., 1994; Chen and Dovichi, 1996; Shcra ct al., 1990; Sopcr et al.. 1992), 
two-photon excited fluorescence (Mertz, 1995), and electrochemical detection (Fan and Bard, 
1995). 

A numerical SNR value can be calculated based on the desired aspects of a CCD camera. 
25 For example, a CCD that satisfies the criteria set out above is an EEV 05-20 CCD (Princeton 
Instruments, (Princeton, NJ) which has the specifications set forth in Table 1 : 



Table 1 


CCD format 


1152x770 


dynamic range, bits 


14 to 17 


pixel size, pm 


22.5x22.5 


non-linearity, % 


<1 (16 bits) 


full well capacity, ke* 


500 


dark charge at-120°C, 
electrons/pixel-hour 


<1 



BNSDOCID <WO 



98350 12A2 I > 



WO 98/35012 




-48- 


PCT/US98/03024 




Table 1 


readout noise, e' 


4-6, 50 kHz 
22,500 kHz 


quantum eff., %peak 


40 



In order to calculate a SNR, an equation that considers the various noise values is needed. 
In any detection system, there is always a basal level of noise that may hinder detection of signals 
5 that are weak and intermittent. Intuitively, the larger the desired signal, the less important the 
basal level of noise becomes. A larger signal and hence a larger SNR can be attained by 
increasing either the number of fluorophores or the excitation intensity. The equation for 
calculating the SNR is 



SNR = l!L 



PE = ^P tENG 
hv 

PE represents the number of photoelectrons emitted from the detector and is related to the efficiency 
10 (E) of the detector system. The higher the efficiency (E), the greater the number of photoelectrons 
emitted for a given number of photons. PE is also related to the number of fluorophores (N), pre- 
amplifier gain (G), integration time (t), intensity of light (I), molar extinction coefficient of the 
fiuorophorc (c), and a fluorescence constant specific for the chosen fluorophore (p). N shot represents 
the noise due to random fluctuations of the fluorophore emission and is related to the magnitude of 
15 the signal produced. The larger the signal, the larger the shot noise. N dark and N rcad are dark noise 
and read noise. N lamp is the noise generated from the illuminating source. It is important to note that 
N jamp can have large influences in the SNR. An illumination source that has a 1% intensity 
fluctuation can have a SNR only as large as 100:1. In this case, the lamp noise is minimized by 
using a stable power supply. For example, commercially available (Uniphase) helium-neon lasers 
20 with modified power supplies have intensity fluctuations of less than 0.002% of the total intensity. 
Furthermore, a tungsten filament lamp can be equipped with a stable power supply so that the output 
of the bulb fluctuates less than 0.001%. Table 2 lists the values for the variables and provides the 
reason the particular values were chosen. 
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Table 2 


variable 


value 


reason 


1 


30 W/cm 2 


intensity of 2 W laser is given by P/A. A is the beam area. Intensity 
of laser is 64 W/cm 2 


e 


91,000 1/M cm 


molar extinction coefficient for fluorescein 


P 


3.8x lO* 21 M cm 3 


constant for fluorescence emission of fluorescein 


t 


0.010 s 


integration time for CCD operating at 100 frames/s 


E 


0.025 


collection efficiencyof system including filters, objective, and 

niiQntiimpffif ipn^i/ f\C 
LJUdMllill 1C J 1 ILlCflLy Ul V^V_,i_7 


N 

IN 


j ooo 


ICaoUllaulc llUMlUd Ul UUI IvJI 1 1 UUI Upl IUI go/ pi aCI 


G 


60 


tvnir^l nrp-^mnlifipr n^iir nam fnr If^C^n ran hp a«; hinh 10 000*1 


h 


6 6261 x 10" 34 J s 


Planck'*; constant 


v 


6.1 224x1 0 14 x S" 1 


c = vA" A = 490nm (excitation of fluorescein^ 




632 e- 1 


N shol =PE 




8e' 1 


0.002% intensity fluctuation 


Head 


40 e" 1 


estimation for EEV 05-20 CCDat high readout speeds 


N d ark 


-0 


value for EEV 05-20 CCD at -120°C 


SNR 


631:1 


from the above equation 



A SNR of 631:1 is high, A high SNR ensures efficient detection of both intensity and 
intensity changes, or in other words, donor emission in the absence and presence of an acceptor. For 

20 example, a SNR of 63 1 : 1 means that there is a 66% confidence of detecting a 0. 1 58% (1/632 x 1 00) 
change in the signal. There is a 95% confidence in detecting a 0.316% change and a 99.9% 
confidence for a 0.475% change. The larger the intensity change, the greater chance of detection. 
A higher baseline SNR in the absence of any intensity changes allows for greater confidence 
intervals for a given percentage intensity change. In order to maximize the confidence of detection 

25 of a signal change, it is important to generate large percentage changes in the presence of acceptors. 

One method for generating a large percentage change is to cluster multiple donor 
fluorophores around the interaction station, e.g., in a concentric ring which the polymer can pass 
through, ensuring that all donor fluorophores will undergo energy transfer with the acceptor. An 
appropriate width of such a concentric ring of donor fluorophores can be determined by the rate 

30 of emission of the donors and also the Forster distance of the donor-acceptor pair. Typical 
changes in intensity upon acceptor passage range from 30% to 50%, which correspond to an 
unequivocal 100% confidence. 
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Before the CCD camera can process a signal each signal generated must be captured by 
pixels of the detector system. Each pixel should be capable of capturing a signal from a single 
interaction station and should have the ability to detect transient changes in the signal. The area 
of an interaction station, e.g., the localized region of agent on a nanochannel, detected by one 
5 pixel is determined by the pixel size on the detector, the magnification of the image, and also the 
diffraction limit of the wavelengths to be measured. The relationship between the measured area, 
detector pixel size, and magnification is given by the following equation. 



a = IdIM 



The size of the area to be measured is given by a 2 where a is the edge length. The edge length 
10 of the detector pixel is given by d. The magnification is given by M. Using conventional values 
of these variables, with d = 1 5 fiM and M = 60x, the edge length of a measured area turns out to 
be 500 nm, well within range of the resolving power of a 500 nm wavelength signal. The 
resolving power for a microscope sample under epiillumination is presented in the following 
equation (Matsumoto, 1993). 

R f = 0.61X I NA 



1 5 R. is the minimum distance between two bright points that can be resolved. X is the wavelength 
of the donor fluorescent emission. NA is the numerical aperture of the microscope objective. 
The highest numerical aperture is desired because of the inverse relationship between NA and 
the resolving power. NA is given by the following classic criterion for lens selection (Taylor 
and Salmon, 1989): 



NA ~ nsinS 

20 n is the index of refraction of the immersion medium. It is often desirable to use a higher index 
medium such as oil. 0 is the angle between the optical axis and the greatest marginal ray entering 
the lens. For quality microscope objectives at high powers, the numerical aperture can be as high 
as 1 .4. From the above two equations, the resolving power for a 500 nm emission signal and a 
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1 .4 N A lens becomes 2 1 8 nm. 

The conclusion from the above calculations is that two adjacent pixels on the detection 
unit can each detect signals from their respective interaction stations without confusion of the 
origins of the signals. By carefully calculating the magnification and the pixel size cross- 
5 interference between pixels can be avoided. For example if the smallest pixel size (d = 6 jiM) 
and the largest magnification (M = lOOx), were used a 120 nm edge length on the detected area 
is achieved. Since the resolving power of the system remains constant at 21 8 nm, interference 
of signals would exist. This can be avoided by performing calculations prior to defining the 
experimental set up. Hence, by adjusting the magnification and the pixel size on the detection 
) 0 system, an optimal number can be reached where the measured area matches the diffraction limit. 
This can be determined from a combination of the above two formulas by setting the edge length 
equal to or greater than the resolving power of the detection system: 

IdIM > Q.GlXlNA 

the density of the interaction stations can be varied so that only one interaction station 
corresponds to one pixel of the detector system. For example, if a 4 x 10 6 nanochannels/cm 2 
15 plate is used, 0.01 nanochannels (interaction stations) are found per pixel using a 60x 
magnification and a 15 \xM pixel size. The interaction station density can be adjusted even 
lower, e.g., an average minimum inter-pore distance of 48 \\M is also within the appropriate 
range. 

In addition to intensity imaging both time-gated fluorescence microscopy and 
20 fluorescence lifetime imaging may be used to detect signals according to the methods of the 
invention. Time-gated fluorescence microscopy and fluorescence lifetime imaging are more 
involved methods that have advantages such as temporal discrimination of fluorescent signals 
and better signal-to-noise ratios than intensity imaging. 

Time-gating is desirable if there is significant background scattering or autofluorescence. 
25 Background light scattering is a problem when the scattered wavelengths are equal to the 
emission wavelengths of the desired fluorophores. Scattering effects can be avoided by using 
a fluorophore with a large Stokes shift so that the scattered wavelengths are shorter than the 
detected wavelength. In this case, the scattered light can be eliminated by using optical filters. 



BNSDOCtD: <WO 



983501 2A2 I > 



WO 98/35012 PCT/US98/03024 

-52- 

On the other hand, autofluorescence is a common problem affecting essentially all studies 
employing fluorescence microscopy. Autofluorescence can arise from solvents, solutes, and the 
optical components of the microscope system. Autofluorescence decreases the signal-to-noise 
ratio of detection. This is the case even though there have been many improvements in the 
5 various components of the detector systems (Periasamy and Herman, 1 994). 

Time-gated fluorescence microscopy (TGFM) utilizes differing fluorescence lifetimes 
to make a distinction between autofluorescence and fluorescence. Lifetimes of fluorescent dyes 
can be chosen to be longer than that of autofluorescence. Short lived autofluorescence decays 
to zero in less than 1 \is whereas long lived fluorescence can demonstrate lifetimes from 1 (is to 
10 10 ms (ex. europium chelates, lanthanide chelates). Excitation of the sample is done with a brief 
intensity pulse, shorter than the lifetime of either the autofluorescence or fluorescence. 
Exponential decay follows. If measurements are made only after the decay of the fluorescent 
signal, then the longer lived signals are measured with a greater sensitivity. 

An example of an imaging apparatus for TGFM is provided in Periasamy, 1995. A 
15 fluorescent microscope (Nikon) with epiillumination capabilities and a continuous wave (CW) 
laser (Coherent or Spectra-Physics) are shown in the reference, emitting at the desired excitation 
wavelength. The excitation laser light is chopped by a chopper with a chopper control to create 
laser pulses with defined pulse widths. The intensity of the light is controlled using a variable 
neutral density filter (Omega Optics). To delay the time of measurement, a delay pulse generator 
20 is used to generate a signal for controlling a high frequency gated image intensifier (Mamamatsu) 
or a chopper in the emission light path. A CCD camera (Princeton Instruments or Photometries) 
is used to collect the signals. 

The fluorescence lifetime represents the average amount of time a molecule remains in 
the excited state prior to its return to ground state. There are two methods for the measurement 

25 of fluorescence lifetimes. These are the pulse method and the phase-modulation method 
(Lakowicz, 1986; McGown, 1989; Gratton and Limkema, 1983). In the pulse method, the 
sample is excited with a brief pulse of light and the time-dependent fluorescent decay is 
measured. In the phase-modulation method, the sample is excited with a sinusoidally modulated 
light. The phase shift and demodulation is used to calculate the lifetime. Until recently, lifetime 

30 measurements were only used with cuvette samples. In the past five years, there has been 
development of methods that combine microscopic two-dimensional resolution and high- 
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resolution lifetime measurements (Rodgers and Firey, 1985; Wang et al., 1990; Morgan et al., 
1990; Clegg et ah, 1991; Lakowicz and Berndt, 1991; Buurman et al., 1992; van de Ven and 
Gratton, 1 992; Oida et ah, 1 993). The development of fluorescence lifetime imaging microscopy 
(FLIM) has allowed the detailed study of location and environment of fluorescent labels in cells 
5 and other microscopic samples. In the following, the advantages, theory, and applications of 
FLIM arc discussed. 

Fluorescence lifetime measurements have been used for a variety of reasons including 
specificity, sensitivity, quantitation, and high temporal resolution (Wang et al., 1996). 
Measurement of lifetimes provide high specificity because fluorescent molecules have distinct 

10 lifetimes. In comparison to absorption and emissions, lifetimes provide greater discrimination 
of molecules. Lifetimes can also be carried out on small amounts of molecules, leading to 
similar sensitivities as intensity measurements. Quantitation of molecules through lifetimes 
provide a true measurement because fluorescence lifetime is directly related to the fluorescence 
quantum yield of the fluorophore. Lastly, lifetimes can be used to detect temporal events that 

15 occur on the time scale of biomolecular processes, usually between a picosecond and a 
microsecond. 

There are two methods of determining lifetimes. The pulse method is described first 
(Lakowicz, 1986). Consider an short pulse of light exciting a population of fluorophores. The 
fluorescent signal of the excited molecules decays with time in a first order manner, given as the 
20 following exponential function: 



/(/) = Ae<-'^ 

a is an arbitrary constant, t is the time, and x is the fluorescent lifetime. Intuitively, the 
fluorescent lifetime is the time required for the intensity to decay to 1/e of the original value, a 
decay of 63%. One of the methods to experimentally measure the lifetime is to use a pulse- 
sampling method (Herman et al., 1 996). D, and D 2 are collected on consecutive frames. Frames 
25 are analyzed with a lifetime equation and a Figure 2 and 2-D lifetime array is generated. 
Following each excitation event, the multichannel plate gated image intensifier (MCP-GI1) 
attached to the CCD camera is turned on for a very brief interval (i.e. 4 ns) at some time interval 
(t,) after the exciting pulse. The emission is acquired on a CCD that is continually on. The 
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identical process is repeated for a large number of times to capture a sufficient signal on one 
frame of the CCD. After a sufficient signal at t, is generated, the CCD is read out and the gate 
window with respect to the excitation pulse (t 2 ) is shifted and the whole process is repeated. 
Interpretation of the two frames with a pixel by pixel analysis gives the lifetime of the image at 
each point. 

Wang et al., 1996 describe the apparatus for pulse FL1M as consisting of five main 
components: 1) pulsed light source; 2) image detection system (gated image intensifier and CCD 
camera; 3) timing control unit; 4) and fluorescence microscope. The system is identical to the 
apparatus described for time-gated fluorescence microscopy (TGFM) described above, except 
for the pulsed light source which is a picosecond pulsed light source (Coherent) consisting of a 
mode-locked YAG laser, a dye laser with a third harmonic generator, and a cavity dumper. 
Picosecond pulses having tunable wavelengths from UV to IR. Rates from single shot to 76 
MHZ are generated. 

The second method of fluorescence lifetime determination is by the phase-modulation 
15 method. Instead of using a pulsed light source of excitation, the method uses a light whose 
intensity is modulated sinusoidally. The emission of the sample, therefore, follows the same 
sinusoidal variations. The modulated emission is delayed in phase because of the excited 
lifetime of the fluorescent molecules. The magnitude of the phase shift (cp) is directly related to 
the lifetime of the fluorophore. Furthermore, there is demodulation of the emission. In other 
20 words, the amplitude of the final emission is smaller in amplitude than that of the excitation light. 
Both the phase angle (<p) and the demodulation factor (m = BA/ba) are measured and used to 
calculate the phase (x p ) and modulation lifetimes (xj (Lakowicz, 1986). 



t = tan <fr _ _ VKlTm 2 )-l 

P l m 



In single exponential decay, x p = x m = x, the actual fluorescence lifetime. 

The basic theory behind phase modulation lifetime determination can be applied to a two- 
dimensional imaging system. The following description is based on from Gadella et al., 1993 and 
Lakowicz and Szmacinski, 1 996, each of which are hereby incorporated by reference. The method 
uses a gain-modulated image intensifier that generates an image that has an intensity related to the 
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phase shift of the emission signal. With use of several phase-sensitive images collected with various 
electronic delays or phase shifts, it is possible to calculate the lifetime image of the object. To 
understand this further, it is necessary to present the equation that describes the time-averaged, 
phase-sensitive intensity from a certain position r: 

J(r,0-Z>) = J 0 (r)[l + V 2 m D m(r) cos(0(r) -0^)] 



5 where r denotes the pixel position, 0 D is the phase angle of the gain modulation signal, 0(r) is the 
phase of the emission, m D is the gain modulation of the detector, m(r) is the modulated amplitude 
of the emission, I c (r) is the original intensity of the pixel (which depends on concentration). The 
equation describes the intensity of a given pixel as a function of two controlled parameters (6 D and 
m D ) and three unknowns (I c (r), 9(r), m(r)). Recall that the lifetime (t) can be determined if either 

10 G(r) or m(r) is known. Since there are three unknowns, at least three different images are needed to 
determine the lifetime of the specimen. By controlling the phase angle of the gain modulation signal 
(G u ), a series of phase-sensitive images are generated and hence lifetimes can be determined. 

The apparatus for phase-modulation FLIM is described in Lakowicz and Szmacinski, 1996. 
Excitation is provided by the output of a cavity-dumped laser, which is synchronously pumped by 

15 a mode-locked Nd:YAG laser. The excitation light is expanded by a laser beam expander. The 
gated image intensifier is positioned between the target and the CCD camera. The gain of image 
intensifler is modulated using output of a frequency synthesizer. A CCD camera captures the phase 
sensitive images. A computer with FLIM software processes the output to generate a lifetime image. 
Other interactions involved in the method will produce a nuclear radiation signal. As a 

20 radiolabel on a polymer passes through the defined region of detection, such as the station, nuclear 
radiation is emitted, some of which will pass through the defined region of radiation detection. A 
detector of nuclear radiation is placed in proximity of the defined region of radiation detection to 
capture emitted radiation signals. Many methods of measuring nuclear radiation are known in the 
art including cloud and bubble chamber devices, constant current ion chambers, pulse counters, gas 

25 counters (i.e., Geiger-Miiller counters), solid state detectors (surface barrier detectors, lithium-drifted 
detectors, intrinsic germanium detectors), scintillation counters, Cerenkov detectors, etc. 

Other types of signals generated are well known in the art and have many detections means 
which are known to those of skill in the art. Among these include opposing electrodes, magnetic 
resonance, and piezoelectric scanning tips. Opposing nanoelectrodes can function by measurement 
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of capacitance changes. Two opposing electrodes create an area of energy storage, which is 
effectively between the two electrodes. It is known that the capacitance of two opposing electrodes 
change when different materials are placed between the electrodes. This value is known as a 
dielectric constant. Changes in the dielectric constant can be measured as a change in the voltage 
5 across the two electrodes. In the present example, different nucleotide bases or units of a polymer 
may give rise to different dielectric constants. The capacitance changes as the dielectric constant 
of the unit of the polymer per the equation: C=KC D , where K is the dielectric constant and C„ is the 
capacitance in the absence of any bases. The voltage deflection of the nanoelectrodes is then 
outputted to a measuring device, recording changes in the signal with time. 

1 0 A nanosized NMR detection device can be constructed to detect the passage of specific spin- 

labeled polymer units. The nanosized NMR detection device consists of magnets which can be 
swept and a means of irradiating the polymer with electromagnetic energy of a constant frequency 
(This is identical to holding the magnetic field constant while the electromagnetic frequency is 
swept). When the magnetic field reaches the correct strength, the nuclei absorb energy and 

15 resonance occurs. This absorption causes a tiny electric current to flow in an antenna coil 
surrounding the sample. The signal is amplified and output to a recording device. For known 
labeled compounds, the time of detection is much faster than current means of NMR detection where 
a full spectra of the compound in question is required. Known labeled units of polymers have known 
chemical shifts in particular regions, thereby eliminating the need to perform full spectral sweeps, 

20 lowering the time of detection per base to micro or milliseconds. 

A nanoscale piezoelectric scanning tip can be used to read the different units of the polymer 
based on physical contact of the different polymer units with the tip. Depending on the size and 
shape of the polymer unit, different piezoelectric signals are generated, creating a series of unit 
dependent changes. Labels on units are physically different than native units and can create a ready 
25 means for detection via a piezoelectric scanning tip. Upon contact of a polymer unit with the tip, 
the piezoelectric crystals change and give rise to a current which is outputted to a detection device. 
The amplitude and duration of the current created by the interaction of the polymer unit and the tip 
is characteristic of the polymer unit. 

Optical detectable signals are generated, detected and stored in a database the signals can be 
30 analyzed to determine structural information about the polymer. The computer may be the same 
computer used to collect data about the polymers, or may be a separate computer dedicated to data 



BNSDOCID' <WO 



=83501 2A2 I > 



WO 98/35012 PCT/US98/03024 

-57- 

analysis. A suitable computer system to implement the present invention typically includes an 
output device which displays information to a user, a main unit connected to the output device and 
an input device which receives input from a user. The main unit generally includes a processor 
connected to a memory system via an interconnection mechanism. The input device and output 
5 device also are connected to the processor and memory system via the interconnection mechanism. 

Nanochannels can be prepared by electroless deposition procedures which produce a metal 
fibril running the complete width of a polycarbonate template membrane. The membrane can also 
be produced such that both faces of the membrane are covered with thin metal films to produce a 
nanodisk electrode ensemble, one of the metal films can be removed to expose the surface of the 
10 membrane. The metal films can be removed to expose the surface of the membrane. These 
electrodes can be connected at their bases to a common current collector. This assembly is useful 
for examining changes in current as polymers flow through changes in conductance can be 
measured. The preparation of such plates is described in Martin, C.P.R., Science, 266:1961-1965 
(1994). 

15 Computer programs for data analysis of the detected signals are readily available from CCD 

manufacturers. Such programs may be executed using a general puipose computer such as described 
below. For the methods of the invention, only operations on single pixels need to be performed 
(point operations). The complexity of the point operations depend on the method of imaging used. 
Intensity based imaging offers the fastest manipulation of data and because only arithmetic is 

20 performed on the individual pixels. Regardless of the imaging technique (intensity, TGFM. or 
FLIM), the algorithms performed on each pixel in each method are considered low-level when 
compared to global, whole frame operations that need to be performed in certain more complex 
imaging situations. 

It should be understood that one or more output devices may be connected to the computer 
25 system. Example output devices include a cathode ray tube (CRT) display, liquid crystal displays 
(LCD), printers, communication devices such as a modem, and audio output. It should also be 
understood that one or more input devices may be connected to the computer system. Example input 
devices include a keyboard, keypad, track ball, mouse, pen and tablet, communication device, and 
data input devices such as sensors. It should be understood the invention is not limited to the 
30 particular input or output devices used in combination with the computer system or to those 
described herein. 
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The computer system may be a general purpose computer system which is programmable 
using a high level computer programming language, such as AC@. The computer system may also 
be specially programmed, special purpose hardware. In a general purpose computer system, the 
processor is typically a commercially available processor, of which the series x86 processors, 

5 available from Intel, and similar devices from AMD and Cyrix, the 680X0 series microprocessors 
available from Motorola, the PowerPC microprocessor from IBM and the Alpha-series processors 
from Digital Equipment Corporation, are examples. Many other processors are available. Such a 
microprocessor executes a program called an operating system, of which WindowsNT, UNIX, DOS, 
VMS and OSS are examples, which controls the execution of other computer programs and provides 

10 scheduling, debugging, input/output control, accounting, compilation, storage assignment, data 
management and memory management, and communication control and related services. The 
processor and operating system define a computer platform for which application programs in 
high-level programming languages are written. 

A memory system typically includes a computer readable and writeable nonvolatile recording 
15 medium, of which a magnetic disk, a flash memory and tape are examples. The disk may be 
removable, known as a floppy disk, or permanent, known as a hard drive. A disk has a number of 
tracks in which signals are stored, typically in binary form, i.e., a form interpreted as a sequence of 
one and zeros. Such signals may define an application program to be executed by the 
microprocessor, or information stored on the disk to be processed by the application program. 
20 Typically, in operation, the processor causes data to be read from the nonvolatile recording medium 
. into an integrated circuit memory element, which is typically a volatile, random access memory such 
as a dynamic random access memory (DRAM) or static memory (SRAM). The integrated circuit 
memory clement allows for faster access to the information by the processor than does the disk. The 
processor generally manipulates the data within the integrated circuit memory and then copies the 
25 data to the disk when processing is completed. A variety of mechanisms are known for managing 
data movement between the disk and the integrated circuit memory element, and the invention is not 
limited thereto. It should also be understood that the invention is not limited to a particular memory 
system. 

It should be understood the invention is not limited to a particular computer platform, 
30 particular processor, or particular high-level programming language. Additionally, the computer 
system may be a multiprocessor computer system or may include multiple computers connected over 
a computer network. 
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The data stored about the polymers may be stored in a database, or in a data file, in the 
memory system of the computer. The data for each polymer may be stored in the memory system 
so that it is accessible by the processor independently of the data for other polymers, for example 
by assigning a unique identifier to each polymer. 

5 The information contained in the data and how it is analyzed depends on the number and type 

of labeled units that were caused to interact with the agent to generate signals. For instance if every 
unit of a single polymer, each type of unit (e.g., all the A's of a nucleic acid) having a specific type 
of label, is labeled then it will be possible to determine from analysis of a single polymer the order 
of every unit within the polymer. If, however, only one of the four types of units of a polymer such 

10 as a nucleic acid is labeled then more data will be required to determine the complete sequence of 
the nucleic acid. Several labeling schemes and methods for analyzing using the computer system 
data produced by those schemes are described in more detail below. The labeling strategies are 
described with respect to nucleic acids for ease of discussion. Each of these strategies, however, is 
useful for labeling all polymers. 

1 5 Several different strategies of labeling are possible, involving permutations of different types 

of nucleotides labeled, different percentage of nucleotides labeled, and single-stranded or double- 
stranded labeling and diversity labels, such as compound which bind to a polymer having a specific 
sequence (diversity labels are discussed in more detail below relative to specific embodiments). The 
simplest labeling scheme involves the labeling of all four nucleotides with different labels. Labeling 

20 schemes using three, two, or even one label are also possible. 

A four nucleotide labeling scheme can be created where the A's, C's, G's, and T's of a target 
DNA is labeled with different labels. Such a molecule, upon traversing an interaction station, will 
generate a linear order of signals which correspond to the linear sequence of nucleotides on the target 
DNA. The advantage of using a four nucleotide strategy is its ease of data interpretation and the fact 

25 that the entire sequence of units can be determined from a single labeled polymer. Adding extrinsic 
labels to all four bases, however, may cause steric hindrance problems. In order to reduce this 
problem the intrinsic properties of some or all of the nucleotides may be used to label the 
nucleotides. As discussed above, nucleotides are intrinsically labeled because each of the purines 
and pyrimidincs have distinct absorption spectra properties. In each of the labeling schemes 

30 described herein the nucleotides may be either extrinsically or intrinsically labeled but it is preferred 
that at least some of the nucleotides are intrinsically labeled when the four nucleotide labeling 
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method is used. It is also preferred that when extrinsic labels are used with the four nucleotide 
labeling scheme that the labels be small and neutral in charge to reduce steric hindrance. 

A three nucleotide labeling scheme in which three of the four nucleotides are labeled may 
also be performed. When only three of the four nucleotides are labeled analysis of the data generated 

5 by the methods of the invention is more complicated than when all four nucleotides are labeled. The 
data is more complicated because the number and position of the nucleotides of the fourth unlabeled 
type must be determined separately. One method for determining the number and position of the 
fourth nucleotide utilizes analysis of two different sets of labeled nucleic acid molecules. For 
instance, one nucleic acid molecule may be labeled with A, C, and G, and another with C, G, and 

10 T. Analysis of the linear order of labeled nucleotides from the two sets yields sequence data. The 
three nucleotides chosen for each set can have many different possibilities as long as the two sets 
contain all four labeled nucleotides. For example, the set ACG can be paired with a set of labeled 
CGT, ACT or AGT. 

The sequence including the fourth nucleotide also may be determined by using only a single 

15 labeled polymer rather then a set of at least two differently labeled polymers by identifying the 
position of the fourth nucleotide on the polymer. This can be accomplished by determining the 
distance between labeled nucleotides on a nucleic acid molecule. For example A, C, and G are 
labeled and the detectable signals generated indicated that the nucleic acid molecule had a sequence 
of AGGCAAACG. If the distances between each of the nucleotides in the nucleic acid molecule are 

20 equivalent to the known inter-nucleotide distance for a particular combination of nucleotides except 
the distance between G and G is twice the normal inter-nucleotide distance then a T is positioned 
between the two G^s and the entire molecule has a sequence of AGTGCAAACG. The distance 
between nucleotides can be determined in several ways. Firstly, the polymer and the agent may be 
moved relative to one another in a linear manner and at a constant rate of speed such that a single 

25 unit of the nucleic acid molecule will pass the agent at a single time interval. If two time intervals 
elapse between detectable signals then the unlabeled nucleotide which is not capable of producing 
a detectable signal is present within that position. This method of determining the distance between 
bases is discussed in more detail below in reference to random one base labeling. Alternatively the 
polymer and the agent may be caused to interact with one another such that each labeled unit 

30 interacts simultaneously with an agent to produce simultaneous detectable signals. Each detectable 
signal generated occurs at the point along the polymer where the unit is positioned. The distance 
between the detectable signals can be calculated directly to determine whether an unlabeled unit is 
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positioned anywhere along the nucleic acid molecule. 

Nucleic acid molecules may also be labeled according to a two nucleotide labeling scheme. 
Six sets of two nucleotide labeled nucleic acid molecule can be used to resolve the data and interpret 
the nucleotide sequence. Ambrose et al., 1993 and Harding and Keller, 1992 have demonstrated the 
synthesis of large fluorescent DNA molecules with two of the nucleotides completely extrinsically 
labeled. The average size of the molecules were 7 kb. Six different combinations of two nucleotide 
labeling are possible using the following formula: 



/i* 4 

( c.) = — = — — = 6 

" k\{n-k)\ 2!2! 



where n nucleotides are taken k at a time. The possible combinations are AC, AG, AT, CG, CT, and 
GT. Knowledge of the linear order of the labels in each of the sets allows for successful 

10 reconstruction of the nucleic acid sequence. Using a 4-mer (5'ACGT'3) as a model sequence, the 
theory can be demonstrated. The first set, AC, gives the information that there must be a C after the 
A. This docs not give information about the number of nucleotides intervening the A and the C nor 
does it give information about any G's or T's preceding the A. The second set, AG, shows that there 
is also a G after the A. Set AT shows there is a T after the A. From these three sets, it is then known 

15 that the target DNA is a 4-mer and that one C, one G, and one T follow the A. The subsequent sets 
give information on the ordering of these three nucleotides following the A. Set CG shows that G 
follows C. Set CT shows that T follows C. Set GT finishes the arrangement to give the final 
deciphered sequence of 5'ACGT'3. In addition to the method using six labeled sets of nucleic acid 
molecules, the sequence can be established by combing information about the distance between 

20 labeled nucleotides generating detectable signals as described above and information obtained from 
fewer than six sets of two nucleotide labeled nucleic acid molecules. 

A fourth labeling scheme, the random one nucleotide labeling scheme also may be used. In 
this method, distance information which is obtained by either population analysis and/or 
instantaneous rate of DNA movement is used to determine the number of nucleotides separating two 
25 labeled nucleotides. Analysis of four differently labeled target molecules yields the complete 
sequence. 
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One method of analysis with these labeleing methods includes the use of complementary 
base information. Figure 1 demonstrates the labeling strategy in which two differently labeled DNA 
samples are required. The first sample has two of its non-complementary bases randomly labeled 
with the same fluorophore. Non-complementary pairs of bases are AC, AG, TC, and TG. The 

5 second sample has one of its bases randomly labeled. The base chosen for the second sample can 
be any of the four bases. In the example given, the two non-complementary bases are chosen to be 
A and C. As a result, two samples are prepared, one with labeled A's and C's and another with 
labeled A's. The DNA is genomically digested, end-labeled, purified, and analyzed by nanochannel 
FRET sequencing. The sequence-specific FRET information arising from each fragment is sorted 

10 into one of two complementary strand groups. Sorting allows population analysis to determine the 
positions of all the desired bases. The figure illustrates the generation of sequence information from 
the sorted data. The first group of analyzed information yields the positions of all the A's and C's 
on one strand. The second group analyzed yields knowledge of all the A's and C's on one strand. 
The same procedure is applied to the complementary stand. Knowledge of the complementary 

15 strand's A's and C's is identical to knowledge of the T's and G's on the other stand. The result is 
sequence reconstruction. To cross-verify the sequence, the process can be repeated for the other 
pairs of non-complementary bases such as TG, TC and AG. 

There are two methods of determining the distance between bases. One requires determining 
the instantaneous rate of DNA movement, which is readily calculated from the duration of energy 
2<» transfer or quenching for a particular label. Another involves analyzing a population of target DNA 
molecules and its corresponding Gaussian distance distributions. 

The instantaneous rate method, involves a determination of distance separation based on the 
known instantaneous rate of DNA movement (v) multiplied by the time of separation between 
signals (t). Instantaneous rate is found by measuring the time that it takes for a labeled nucleotide 
25 to pass by the interaction station. Since the length of the concentrated area of agent (d) is known 
(through calibration and physical measurement of the localized region of the agent, e.g.. the 
thickness of a concentrated donor fluorophore area), the rate is simply v=d/t. As shown in Figure 
2 analysis of raw data demonstrating changes in energy emission patterns resulting from sequential 
detectable signals when plotted produces a curve which from left to right shows two energy intensity 
. 30 decreases, followed by two energy intensity increases. The plateau from the first energy intensity 
decrease (denoted t,) is double that of the second plateau (t 2 ). The length of the interaction station 
is given as 51 A. From this given information, the number of labeled nucleotides is known. 



WO 98/35012 PCT/US98/03024 

-63- 

Furthermore, the distance of separation of the two is determined by relating the rate of DNA 
movement to the time of the donor intensity plateaus. 

} 

The number of labeled nucleotides is simply denoted by the number of intensity decreases. 
In Figure 2, there are two intensity decreases. Accordingly, there must be two detectable labels on 

5 the DNA. To determine the distance of base separation, it is necessary to know the instantaneous 
rate of DNA movement, which is found by knowing the time for one labeled nucleotide to cross the 
localized region of the agent and the length of the localized region of the agent. The length of the 
localized region of the agent is given as 51 A. The time for one labeled nucleotides crossing the 
localized region of the agent is bounded by the first intensity decrease and the first intensity increase 

10 (denoted as the gray shaded region, 7.5 s). The rate of DNA movement is 6.8 A/s. The base 
separation is derived from the time separating the labeled nucleotides (t, = 5 s) multiplied by the rate 
(6.8 A), which is equal to 10 base pairs. As a means of cross- verification, 51 A - t 2 v also yields the 
base separation. 

In the population method the entire population of labeled nucleotide is considered. 
15 Knowledge of the length of the localized region of the agent and instantaneous rate, as required for 
the rate method, is not necessary. Use of population analyses statistically eliminates the need for 
precision measurements on individual nucleic acid molecules. 

An example of population analyses using five nucleic acid molecules each traversing a 
nanochannel is described below. Five molecules representing a population of identical DNA 
20 fragments are prepared. In a constant electric field, the time of detection between the first and 
second labeled nucleotide should be identical for all the DNA molecules. Under experimental 
conditions, these times differ slightly, leading to a Gaussian distribution of times. The peak of the 
Gaussian distribution is characteristic of the distance of separation (d) between two labeled 
nucleotides. 

25 An additional example utilizing a population of one nucleotide randomly labeled nucleic 

acid molecule (six molecules represent the population) further illustrates the concept of population 
analysis and the determination of distance information. The nucleic acid is end-labeled to provide 
a reference point. With enough nucleic acid molecules, the distance between any two A's can be 
determined. Two molecules, when considered as a sub-population, convey the base separation 

30 between the end-label and the second A and also between the second and third A's. The times of 
separation are used to create distributions which are representative of base distance. In the same two 
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molecules, distributions of 4 and 6 base separations are created. Extending the same logic to rest 
of the population, the positions of all the A's on the DNA can be determined. The entire sequence 
is generated by repeating the process for the other three bases (C, G, and T). 

In addtion to labeling all of one type of unit, it is possible to use a one-nucleotide labeling 
5 scheme where not every nucleotide of one type is labeled. An outline of this method is shown in 
Figure 3 which shows a representative population of random A-labeled fragments for a 16-mer with 
the sequence 5'ACGTACGTACGTACGT'3. Each individually labeled DNA molecule has half of 
its A's labeled in addition to 5' and 3' end labels. With a large population of randomly labeled 
fragments, the distance between every successive A on the target DNA can be found. The end labels 

10 serve to identify the distance between the ends of the DNA and the first A. Repeating the same 
analysis for the other nucleotides generates the sequence of the 16-mer. The advantages of using 
such a method includes lack of steric effects and ease of labeling. This type of labeling is referred 
to as random labeling. A polymer which is "randomly labeled" is one in which fewer than all of a 
particular type of unit are labeled. It is unknown which units of a particular type of a randomly 

15 labeled polymer are labeled. 

As mentioned briefly above, various combinations of the labeling schemes can be used 
together. In all of the methods listed above, either ordered linear information from signals or 
distance information between nucleotides is considered. These two aspects can be combined to give 
methods that rely on both ordered linear and distance information. For example a random one 

20 nucleotide labeling strategy expanded to a random four nucleotide labeling strategy. Random four 
nucleotide labeling is where a fraction of all four nucleotides is labeled. A population of molecules 
have a fraction of the four nucleotides labeled. Each of the four nucleotides have a unique label. 
Analysis of a randomly labeled population results in generation of the sequence data. 

The use of double-stranded DNA allows for variations. A single-stranded two-nucleotidc 

25 labeling scheme can be performed when two of the nucleotides on one strand of DNA are fully 
replaced by labeled nucleotides. To reduce the steric constraints imposed by two extrinsically 
labeled nucleotides while preserving the theory behind two-nucleotide labeling, it is possible to label 
one nucleotide fully on each of the complementary strands to achieve the same end. This method 
involves using double-stranded DNA in which each strand is labeled with a different label. Six 

30 differently labeled duplex DNA sets will produce a data set which is adequate to provide sequence 
information. Each complementary strand of DNA should have one of the nucleotides labeled. In 
each of the duplex DNA sets, the equivalent of two different nucleotides (possible combinations are 
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AC, AG, AT, CG, CT, GT) are labeled. When both complementary strands have the adenines 
labeled, this is equivalent to the combination AT. In duplex two-nucleotide labeling, the advantage 
is that only one nucleotide on each strand is labeled, allowing longer labeled strands to be 
synthesized as compared to two-nucleotide labeling on single-stranded DNA. In practice, it has been 

5 shown that synthesis of DNA fragments with one nucleotide completely labeled can be achieved 
with lengths much greater than 10 kb (Ambrose et al., 1993; Harding and Keller, 1992). 

One use for the methods of the invention is to determine the sequence of units within a 
polymer. Identifying the sequence of units of a polymer, such as a nucleic acid, is an important step 
in understanding the function of the polymer and determining the role of the polymer in a 

10 physiological environment such as a cell or tissue. The sequencing methods currently in use are 
slow and cumbersome. The methods of the invention are much quicker and generate significantly 
more sequence data in a very short period of time. 

Sequencing of a polymer may encompass the sequencing of the entire polymer or portions 
of the polymer or even the identification of an individual unit on the polymer. One method for 

15 identifying an individual unit of a polymer involves the steps of transiently exposing the individual 
unit of the polymer, the identity of which is unknown, to an agent selected from the group consisting 
of electromagnetic radiation, a quenching source and a fluorescence excitation source to generate 
an interaction with a detectable signal characteristic of the individual unit, and detecting and 
distinguishing the signal from signals generated from adjacent signal generating units of the polymer 

20 as an indication of the identity of the individual unit. 

The individual unit is "transiently exposed" to the agent in order to produce a detectable 
signal characteristic of the individual unit. "Transiently exposed" as used herein means that the unit 
is positioned within interactive proximity of the agent for enough time to produce a signal and then 
is moved out of interactive proximity. The exact length of time required to produce a signal will 

25 depend on the individual unit and the agent involved but generally the amount of time is between 
one nanosecond and one second. 

The signal characteristic of the individual unit is distinguished from signals generated from 
adjacent signal generating units of the polymer. An "adjacent signal generating unit" is the unit 
nearest to the individual unit which when exposed to the agent produces a detectable signal. It is not 

30 necessarily the unit which is directly linked to the individual unit unless the unit which is directly 
linked is labeled (intrinsically or extrinsically) and produces a detectable signal. 

In the case when the agent is one or more fluorophores the interactive proximity between the 
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agent and the unit is the energy transfer proximity and the signal produced is fluorescence resonance 
energy, transfer. "Energy transfer proximity" as used herein is the distance between the unit and the 
fluorophore which allows interaction between two complementary sources if one source overlaps 
with the absorption spectrum of the other source. "Fluorescence resonance energy transfer" as used 
5 herein is the transfer of photonic energy between fluorophores with overlapping emission and 
absorption spectra. 

Another method for identifying an individual unit of a polymer involves the steps of 
transiently moving the individual unit of the polymer relative to a station, the identity of the 
individual unit being unknown, detecting a signal arising from a detectable physical change in the 
10 unit or the station, and distinguishing said signal from signals arising from exposure of adjacent 
signal generating units of the polymer to the station as an indication of the identity of the individual 
unit. 

Thus in one aspect, the methods of the invention can be used to identify one, some, or all of 
the units of the polymer. This is achieved by identifying the type of individual unit and its position 

1 5 on the backbone of the polymer by determining whether a signal detected at that particular position 
on the backbone is characteristic of the presence of a particular labeled unit. 

The methods of the invention also are useful for identifying other structural properties of 
polymers. The structural information obtained by analyzing a polymer according to the methods of 
the invention may include the identification of characteristic properties of the polymer which (in 

20 turn) allows, for example, for the identification of the presence of a polymer in a sample or a 
determination of the relatedness of polymers, identification of the size of the polymer, identification 
of the proximity or distance between two or more individual units of a polymer, identification of the 
order of two or more individual units within a polymer, and/or identification of the general 
composition of the units of the polymer. Such characteristics are useful for a variety of purposes 

25 such as determining the presence or absence of a particular polymer in a sample. For instance when 
the polymer is a nucleic acid the methods of the invention may be used to determine whether a 
particular genetic sequence is expressed in a cell or tissue. The presence or absence of a particular 
sequence can be established by determining whether any polymers within the sample express a 
characteristic pattern of individual units which is only found in the polymer of interest i.e., by 

30 comparing the detected signals to a known pattern of signals characteristic of a known polymer to 
determine the relatedness of the polymer being analyzed to the known polymer. The entire sequence 
of the polymer of interest does not need to be determined in order to establish the presence or 
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absence of the polymer in the sample. Similarly the methods may be useful for comparing the 
signals detected from one polymer to a pattern of signals from another polymer to determine the 
relatedness of the two polymers. 

The proximity of or distance between two individual units of a polymer may be determined 

5 according to the methods of the invention. It is important to be able to determine the proximity of 
or distance between two units for several reasons. Each unit of a polymer has a specific position 
along the backbone. The sequence of units serves as a blueprint for a known polymer. The distance 
between two or more units on an unknown polymer can be compared to the blueprint of a known 
polymer to determine whether they are related. Additionally the ability to determine the distance 

10 between two units is important for determining how many units, if any, are between the two units 
of interest. 

In order to determine the proximity of two individual units of a polymer of linked units the 
polymer is moved relative to a station, where each individual unit is exposed to the station to 
produce a characteristic signal arising from a detectable physical change in the unit or station. Each 

15 of the characteristic signals generated is then detected and the amount of time elapsed between 
detecting each characteristic signal is measured as described above. The amount of time elapsed is 
indicative of the proximity of the two individual units. The station may be an interaction station and 
the unit may be exposed to an agent to produce an electromagnetic signal. 

A "signal characteristic of an interaction" as used herein is a signal which is expected to 

20 result from the interaction of the station and a specific labeled unit. The specific signal generated 
will depend on the type of station as well as the type of labeled unit. For instance if the station is 
an agent which is electromagnetic radiation and the labeled unit is a fluorophore then the interaction 
between the two will result in the emission of electromagnetic radiation by the fluorophore at a 
wavelength at which the fluorophore is known to emit. If the station is an agent which is a 

25 scintillation layer and the unit is radioactive then the interaction between the two will result in the 
emission of electromagnetic radiation in the form of fluorescence. 

It is possible to determine the order of the units of a polymer using the methods of the 
invention. In one aspect of the invention the order of two individual units of a polymer can be 
determined by moving the polymer linearly with respect to a station and exposing two of the 

30 individual units to the station to produce first and second detectable signals arising from physical 
changes in the station or the unit. The order of the signals is an indication of the order of the two 
individual units. 



BNSDOCID- <WO 



9P35012A2 I > 



WO 98/35012 PCT/US98/03024 

-68- 

The general composition of the units of the polymer may also be determined by the methods 
of the invention. For instance, if the polymer is a nucleic acid the methods of the invention can 
provide information on the percentage of purines vs. pyrimidines or the percentage of A, C, T, and 
G in the nucleic acid. 

5 Quantitative information on the size of the sample may also be determined by the methods 

of the invention. For instance, the size of a polymer can be determined by determining the number 
of individual units which make up the polymer. The number of units which make up the polymer 
is determined by measuring the amount of time that is required for the entire polymer to traverse past 
an agent at an interaction site in a linear manner and dividing that by the average length of time for 

10 an individual unit of that particular type of polymer to completely traverse past the site. 

In addition to information about a specific unit the methods of the invention may be used to 
identify greater than one unit at a time in order to provide information about a polymer. In one 
aspect the method is carried out by providing a labeled polymer of linked units, detecting signals 
from labeled unit specific markers of less than all of the linked units, and storing a signature of the 

15 signals detected to analyze the polymer. In this aspect of the invention each unit of the labeled 
polymer may be labeled with a unit specific marker or less than all of the units may be labeled with 
a unit specific marker. 

This method is particularly useful for analyzing multiple units of a polymer at one time. 
This is accomplished by using a unit specific marker which is labeled and which interacts with more 

20 than one unit in a sequence specific manner. As used herein a "unit specific marker" is a compound 
which specifically interacts with one or more units of a polymer and is capable of identifying those 
units. For instance a unit specific marker for a nucleic acid molecule can be a labeled dimers, 
trimers, etc. which bind to a specific sequence of bases, such as TG, AG, ATC, etc. By identifying 
the presence or position of the labeled markers structural information about the polymer can be 

25 derived. For instance, the presence of the marker on a polymer can reveal the identity of the 
polymer. This enables the presence or absence of a polymer in a solution or mixture of polymers 
to be determined. The order, distance, number etc. of the markers on a polymer can provide 
information about the sequence or composition of a polymer. Other unit specific markers include 
but are not limited to sequence specific major and minor groove binders and intercallators, sequence 

30 specific DN A and peptide binding proteins, sequence specific peptide-nucleic acids, etc. Many such 
unit specific markers exist and are well known to those of skill in the art. 

This type of analysis can be used in one embodiment to identify DNA fragments by 
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analyzing the hybridization patterns of multiple probes to individual fragments of polymers. The 
current state-of-the-art methods for hybridization analysis of DNA rely upon DNA chips. The 
methods of the invention is advantageous for a number of reasons. The number, type, order, and 
distance between the multiple probes bound to an unknown fragment of DNA can be determined. 

5 This information can be used to identify the number of differentially expressed genes 
unambiguously. Current hybridization approaches can only determine the type of probes bound to 
a given fragment. Furthermore, the methods of the invention are able to quantitate precisely the 
actual number of particular expressed genes. Current methods rely on quantitation of fluorescence 
intensities, which often give rise to errors due to non-linearities in the detection system. Given the 

10 great amount of information generated, the methods of the invention do not require a selection of 
expressed genes or unknown nucleic acids to be assayed. This is in contrast to the requirement of 
different DNA chips for different genes, sets of expressed genes to be analyzed, and also different 
organisms. The methods of the invention can identify the unknown expressed genes by computer 
analysis of the hybridization patterns generated. The data obtained from linear analysis of the DNA 

1 5 probes are then matched with information in a database to determine the identity of the target DNA. 
The methods can thus analyze information from hybridization reactions, which can then be applied 
to Jugnostics and determination of gene expression patterns. 

A "signature" as used herein is a sequence-specific signal arising from a labeled polymer. 
The signature includes information about the structure of the polymer. For instance, the signature 

20 of a polymer may be defined by a series of consecutive units or by specific units spaced a particular 
distance apart from one another. The signature of the polymer identifies the polymer. Signatures 
arc useful for uniquely identifying fragments by identifying bases at certain positions along the 
length of a strand of DNA. The probability of knowing any one position is 1/4. Unambiguous 
identification of a fragment comes with roughly twenty positions identified (l/4 20 =9.1xl20" 13 ). For 

25 a fragment with 20 detected labels and 10% detection/labeling, the size of the fragment needs to be 
onl> 2()0 base pairs. The proposed read length is on the order of kilobases, which should 
unambiguously identify any fragment. The identification of fragments allows for grouping by 
similar sequences, making sequence reconstruction by population analysis possible. 

A preferred method of analysis of the invention involves the use of radioactively labeled 

30 polymers. The type of radioactive emission influences the type of detection device used. In general, 
there are three different types of nuclear emission including alpha, beta, and gamma radiation. 
Alpha emission cause extensive ionization in matter and permit individual counting by ionization 
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chambers and proportional counters, but more interestingly, alpha emission interacting with matter 
may also cause molecular excitation, which can result in fluorescence. The fluorescence is referred 
to as scintillation. Beta decay which is weaker than alpha decay can be amplified to generate an 
adequate signal. Gamma radiation arises from internal conversion of excitation energy. Scintillation 
5 counting of gamma rays is efficient and produces a strong signal. Sodium iodide crystals fluoresce 
with incident gamma radiation. 

A "scintillation" layer or material as used herein is any type of material which fluoresces or 
emits light in response to excitation by nuclear radiation. Scintillation materials are well known in 
the art. Aromatic hydrocarbons which have resonance structures are excellent scintillators. 
10 Anthracene and stilbene fall into the category of such compounds. Inorganic crystals are also known 
to fluoresce. In order for these compounds to luminesce, the inorganic crystals must have small 
amounts of impurities, which create energy levels between valence and conduction bands. 
Excitation and de-excitation can therefore occur. In many cases, the de-excitation can occur through 
phosphorescent photon emission, leading to a long lifetime of detection. Some common scintillators 
15 include Nal (Tl), ZnS (Ag), anthracene, stilbene, and plastic phosphors. 

Many methods of measuring nuclear radiation are known in the art and include devices such 
as cloud and bubble chamber devices, constant current ion chambers, pulse counters, gas counters 
(i.e., Geiger-Muller counters), solid state detectors (surface barrier detectors, lithium-drifted 
detectors, intrinsic germanium detectors), scintillation counters, Ccrenkov detectors, etc. 
20 Analysis of the radiolabeled polymers is identical to other means of generating polymer 

dependent impulses. For example, a sample with radiolabeled A's can be analyzed by the system 
to determine relative spacing of A's on a sample DNA. The time between detection of radiation 
signals is characteristic of the polymer analyzed. Analysis of four populations of labeled DNA (A's, 
C's, G's, T's) can yield the sequence of the polymer analyzed. The sequence of DNA can also be 
25 analyzed with a more complex scheme including analysis of a combination of dual labeled DNA and 
singly labeled DNA. Analysis of a A and C labeled fragment followed by analysis of a A labeled 
version of the same fragment yields knowledge of the positions of the A's and C's. The sequence 
is known if the procedure is repeated for the complementary strand. The system can further be used 
for analysis of polymer (polypeptide, RNA, carbohydrates, etc.), size, concentration, type, identity, 
30 presence, sequence and number. 

The methods described above can be performed on a single polymer or on more than one 
polymer in order to determine structural information about the polymer. The invention also 
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encompasses the practice of the methods described above on multiple polymers. These methods and 
an apparatus for performing the methods of the invention simultaneously on a plurality of polymers 
are described in detail below. 

Some of the methods described above are based on an interaction involving energy transfer 
5 or quenching to produce a detectable signal. The involvement of energy transfer or quenching is 
described in either one of two limitations in these embodiments of the invention. One limitation is 
that the agent involved in the interaction is selected from the group consisting of electromagnetic 
radiation, a quenching source or a fluorescence excitation source. The other limitation is that the 
detectable signal is an electromagnetic radiation signal. It should be apparent to one of ordinary skill 

10 in the art that each of the methods can encompass the other limitation instead of the one described 
and still encompass the notion of interaction involving energy transfer or quenching. For instance 
in addition to encompassing the method for analyzing a polymer by exposing the units of the 
polymer to the agent selected from the group consisting of electromagnetic radiation, a quenching 
source, and a fluorescence excitation source to produce a detectable signal, the invention in these 

15 embodiments also encompasses a method for analyzing a polymer by exposing the units of a 
polymer to an agent to produce a detectable electromagnetic signal. 

In addition to the methods involving energy transfer, quenching or electromagnetic radiation 
signals, the methods of the invention described above can be performed by detecting signals which 
arise from a detectable physical change in the unit of the polymer or the station. As used herein a 

20 "detectable physical change" in the unit of the polymer or the station is any type of change which 
occurs in the unit of the polymer or the station as a result of exposing the unit to the station. Once 
the unit is exposed to the station a detectable signal is created. The station may be for instance, an 
interaction station or a signal generation station, each of which is discussed in detail herein. The 
type of change that occurs in the station or the unit to produce the detectable signal depends on the 

25 type of station and the type of unit. Several examples of station-unit combinations which undergo 
a change to produce a detectable signal are discussed herein for exemplary purposes. Those of skill 
in the art will be able to derive other station-unit combinations that fall within the scope of the 
invention. 

When the interaction between the station and the unit of the polymer is based on energy 
30 transfer, either the unit or the station or both may physically change to produce a signal. In one 
embodiment the station may transfer energy to the unit causing the unit to emit an energy unit 
specific signal. The physical change which occurs in the unit results from the change in energy state. 
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In another embodiment the unit may transfer energy to the station causing the station to emit a unit- 
specific signal resulting from the specific energy transfer. In still other embodiments a partner 
compound may cause the physical change which produces a signal. When the interaction occurs 
between a radioactive unit and a station the unit physically changes by releasing energy. 
5 Another aspect of the invention encompasses methods for analyzing a plurality of polymers. 

Each of the polymers is analyzed by sequentially detecting interactions of units of a plurality of 
polymers at a signal generation station. These methods include but are not limited to a method for 
characterizing a test polymer, a method for sequencing a polymer, a method for determining the 
order of units of a polymer, a method for determining the distance between units of a polymer, and 
10 analyzing a set of polymers. These methods encompass but are not limited to interactions resulting 
from energy transfer or quenching. 

A method for characterizing a test polymer is performed by obtaining polymer dependent 
impulses for each of a plurality of polymers, comparing the polymer dependent impulses of the 
plurality of polymers, determining the relatedness of the polymers based upon similarities between 
15 the polymer dependent impulses of the polymers, and characterizing the test polymer based upon 
the polymer dependent impulses of related polymers. 

A "polymer dependent impulse" as used herein is a detectable physical quantity which 
transmits or conveys information about the structural characteristics of only a single unit of a 
polymer. The physical quantity may be in any form which is capable of being detected. For instance 
2() the physical quantity may be electromagnetic radiation, chemical conductance, electrical 
conductance, etc. The polymer dependent impulse may arise from energy transfer, quenching, 
changes in conductance, mechanical changes, resistance changes, or any other physical changes. 
Although the polymer dependent impulse is specific for a particular unit, a polymer having more 
than one of a particular labeled unit will have more than one identical polymer dependent impulse. 
25 Additionally, each unit of a specific type may give rise to different polymer dependent impulses if 
they have different labels. 

The method used for detecting the polymer dependent impulse depends on the type of 
physical quantity generated. For instance if the physical quantity is electromagnetic radiation then 
the polymer dependent impulse is optically detected. An "optically detectable" polymer dependent 
30 impulse ;ts used herein is a light based signal in the form of electromagnetic radiation which can be 
detected by light detecting imaging systems. When the physical quantity is chemical conductance 
then the polymer dependent impulse is chemically detected. A "chemically detected" polymer 
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dependent impulse is a signal in the form of a change in chemical concentration or charge such as 
an ion conductance which can be detected by standard means for measuring chemical conductance. 
If the physical quantity is an electrical signal then the polymer dependent impulse is in the form of 
a change in resistance or capacitance. 

5 As used herein the "relatedness of polymers" can be determined by identifying a 

characteristic pattern of a polymer which is unique to that polymer. For instance if the polymer is 
a nucleic acid then virtually any sequence of 10 contiguous nucleotides within the polymer would 
be a unique characteristic of that nucleic acid molecule. Any other nucleic acid molecule which 
displayed an identical sequence of 10 nucleotides would be a related polymer. 

10 A "plurality of polymers" is at least two polymers. Preferably a plurality of polymers is at 

least 50 polymers and more preferably at least 100 polymers. 

The polymer dependent impulses may provide any type of structural information about the 
polymer. For instance these signals may provide the entire or portions of the entire sequence of the 
polymer, the order of polymer dependent impulses, or the time of separation between polymer 

15 dependent impulses as an indication of the distance between the units. 

The polymer dependent impulses are obtained by interaction which occurs between the unit 
of the polymer and the environment at a signal generation station. A "signal generation station" as 
used herein is a station that is an area where the unit interacts with the environment to generate a 
polymer dependent impulse. In some aspects of the invention the polymer dependent impulse results 

20 from contact in a defined area with an agent selected from the group consisting of electromagnetic 
radiation, a quenching source, and a fluorescence excitation source which can interact with the unit 
to produce a detectable signal. In other aspects the polymer dependent impulse results from contact 
in a defined area with a chemical environment which is capable of undergoing specific changes in 
conductance in response to an interaction with a molecule. As a molecule with a specific structure 

25 interacts with the chemical environment a change in conductance occurs. The change which is 
specific for the particular structure may be a temporal change, e.g., the length of time required for 
the conductance to change may be indicative that the interaction involves a specific structure or a 
physical change. For instance, the change in intensity of the interaction may be indicative of an 
interaction with a specific structure. In other aspects the polymer dependent impulse results from 

30 changes in capacitance or resistance caused by the movement of the unit between microelectrodes 
or nanoelectrodes positioned adjacent to the polymer unit. For instance the signal generation station 
may include microelectrodes or nanoelectrodes positioned on opposite sides of the polymer unit. 
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The changes in resistance or conductance which occur as a result of the movement of the unit past 
the electrodes will be specific for the particular unit. 

A method for determining the distance between two individual units is also encompassed by 
the invention. In order to determine the distance between two individual units of a polymer of linked 
units the polymer is caused to pass linearly relative to an signal generation station and a polymer 
dependent impulse which is generated as each of the two individual units passes by the signal 
generation station is measured. Each of the steps is then repeated for a plurality of similar polymers. 
A polymer is said to pass linearly relative to a signal generation station when each unit of the 
polymer passes sequentially by the signal generation station. 

Each of the steps is repeated for a plurality of similar polymers to produce a data set. The 
distance between the two individual units can then be determined based upon the information 
obtained from the plurality of similar polymers by analyzing the data set. 

The method also includes a method for identifying a quantity of polymers including a label. 
For instance, it is possible to determine the number of polymers having a specific unit or 
combination of units in a sample. In a sample of mRNA, for example, the number of a particular 
mRNA present in the sample can be determined. This is accomplished by identifying a pattern or 
signature characteristic of the desired mRNA molecule. The sample of RNA can then be analyzed 
according to the methods of the invention and the number of mRNA molecules having the specific 
pattern or signature can be determined. 

As used herein "similar polymers" are polymers which have at least one overlapping region. 
Similar polymers may be a homogeneous population of polymers or a heterogenous population of 
polymers. A "homogeneous population" of polymers as used herein is a group of identical 
polymers. A "heterogenous population" of similar polymers is a group of similar polymers which 
are not identical but which include at least one overlapping region of identical units. An overlapping 
region typically consists of at least 1 0 contiguous nucleotides. In some cases an overlapping region 
consists of at least 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 22 contiguous nucleotides. 

A "plurality of similar polymers" is two or more similar polymers. Preferably a plurality of 
similar polymers is 50 or more similar polymers. More preferably a plurality of similar polymers 
is 100 or more similar polymers. 

A "data set" as used herein is a set of information defining the polymer dependent impulses 
generated by similar polymers. The data set is analyzed as discussed above and the method of 
analysis used depends on the type of labeling scheme used to generate the labeled polymers. 
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Nucleic acid sequencing is a particularly preferred embodiment of the methods of the invention. 

Currently, less than 5% of the human genome has been sequenced. This translates into a 
small fraction of the ideal in human sequence knowledge, which is the sequence of all individuals. 
For an instance, for the human population, there are 1 .4x1 0 19 (5 billion people x 3xl0 9 bases/person). 

5 So far, only 2x1 0" 10 percent of all human genetic information is known. The rate of sequencing of 
the human genome by all world-wide efforts is roughly 3xl0 9 /15 years, or 550,000 bases/day, at a 
cost of >$l/base. Sequencing by the methods of the invention described herein will constitute an 
inordinate breakthrough in the rate of sequencing. The predicted time to complete one human 
genome with one machine is —15 hours. Several dynamic arrays in parallel will be able to complete 

10 the sequence of one human genome in a fraction of an hour. 

A method for sequencing a polymer of linked units is also encompassed by the invention. 
The method is performed by obtaining polymer dependent impulses from each of a plurality of 
overlapping polymers, at least a portion of each of the polymers having a sequence of linked units 
identical to the other of the polymers, and comparing the polymer dependent impulses to obtain a 

15 sequence of linked units which is identical in the plurality of polymers. 

The plurality of overlapping polymers is a set of polymers in which each polymer has at least 
a portion of its sequence of linked units which is identical to the other polymers. The portion of 
sequence which is identical is referred to as the overlapping region and which includes at least ten 
contiguous units. 

20 In another aspect of the invention the order of units of a polymer of linked units can be 

determined by moving the polymer linearly relative to a signal generation station and measuring a 
polymer dependent impulse generated as each of two individual units, each giving rise to a 
characteristic polymer dependent impulse pass by the signal generation station. These steps are 
repeated for a plurality of similar polymers and the order of at least the two individual units is 

25 determined based upon the information obtained from the plurality of similar polymers. 

A method for analyzing a set of polymers, in which each of the polymers of the set is an 
individual polymer of linked units, is encompassed by the invention. The method involves the step 
of orienting the set of polymers parallel to one another, and detecting a polymer specific feature of 
the polymers. 

30 The set of polymers are oriented parallel to one another. The polymers may be oriented by 

any means which is capable of causing the polymers to be positioned parallel to one another. For 
instance an electric field may be applied to the polymers to cause them to be oriented in a parallel 
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form. Preferably the orientation step is in a solution free of gel. 

A "polymer specific feature" as used herein is any structural feature of polymer which relates 
to its sequence. For instance a polymer specific feature includes but is not limited to information 
about the polymer such as the length of the polymer, the order of linked units in the polymer, the 

5 distance between units of the polymer, the proximity of units in the polymer, the sequence of one, 
some or all of the units of the polymer, and the presence of the polymer. 

The step of detecting the polymer specific feature may be performed simultaneously for all 
of the polymers. This step encompasses the sequential detection of each of the units of all of the 
polymers. This can be accomplished by passing linearly each of the polymers relative to a plurality 

10 of signal generation stations, and detecting and distinguishing polymer dependent impulses 
generated as said polymers pass said signal generation stations. 

The invention also includes a method for analyzing a set of polymers, each polymer of the 
set being an individual polymer of linked units. The method is performed by orienting the set of 
polymers in an electric field, simultaneously moving the set of polymers through defined respective 

15 channels, and detecting a polymer specific feature as the polymers are moved through the channels. 
The step of simultaneously moving the set of polymers through respective channels is carried out 
by moving one polymer per channel such that each unit passes the station individually. More than 
one polymer may be in the channel at a time if the polymers are positioned in tandem and only one 
unit interacts with one station at a time. 

20 A "defined respective channel" as used herein is a channel in which the structure is 

determined before the polymer enters the channel such that the polymer will follow a defined path 
as it passes through the channel. Channels such as those found in a gel matrix are not defined 
respective channels. 

The methods of the invention may also be used to detect resonance energy transfer or 
25 quenching between two interactive partners capable of such transfer or quenching. As used herein 
resonance energy transfer (RET) is the transfer of photonic energy between two compounds with 
overlapping emission and absorption spectra. Fluorescence resonance energy transfer (FRET) is the 
transfer of photonic energy between fluorophores. The two interactive partners are any compounds 
which are capable of energy transfer or quenching ie., light emissive compounds or quenchers. 
30 The method is performed by bringing the two partners in close enough proximity to permit 

such transfer or quenching, applying an agent to one of said partners, the agent selected from the 
group consisting of electromagnetic radiation, a quenching source and a fluorescence excitation 
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source, shielding fluorescence resonance energy transfer and quenching occurring from 
electromagnetic radiation emission and interaction between the partners with a material shield, and 
detecting the emitted electromagnetic radiation. 

As used herein a "material shield" is any material which prevents or limits energy transfer 

5 or quenching. Such materials include but are not limited to conductive materials, high index 
materials, and light impermeable materials. In a preferred embodiment the material shield is a 
conductive material shield. As used herein a "conductive material shield" is a material which is at 
least conductive enough to prevent energy transfer between donor and acceptor sources. 

Each of the above methods of the invention are useful for at least various aspects of 

10 sequencing polymers. Also all of the methods can be used with the various labeling schemes 
described with respect to the method of analyzing polymers. 

The methods of the invention can be accomplished using any device which produces a 
specific detectable signal for an individual unit of a polymer. One type of device which enables this 
type of analysis is one which promotes linear transfer of a polymer past an interaction station or a 

15 signal generation station. According to one aspect of the invention, an article of manufacture which 
is useful for performing the methods of the invention is provided. The article of manufacture 
includes a wall material having a surface defining a channel, an agent selected from the group 
consisting of an electromagnetic radiation source, a quenching source, a luminescent film layer, and 
a fluorescence excitation source, attached to the wall material adjacent to the channel, wherein the 

20 agent is close enough to the channel and is present in an amount sufficient to detectably interact with 
a partner compound selected from the group consisting of a light emissive compound and a quencher 
passing through the channel. 

A "wall material" as used herein is a solid or semi-solid barrier of any dimensions which is 
capable of supporting at least one channel. A semi-solid material is a self supporting material and 

25 may be for instance a gel material such as a polyacrylamide gel. For instance the wall material may 
be composed of a single support material which may be conducting or non-conducting, light 
permeable or light impermeable, clear or unclear. In some instances the agent is embedded within 
the wall material. In these instances the wall material can be solely or partially made of a non- 
conducting layer, a light permeable layer or a clear layer to allow the agent to be exposed to the 

30 channel formed in the wall material to allow signal generation. When the wall material is only 
partially made from these materials the remaining wall material may be made from a conducting, 
light impermeable or unclear layer, which prevent signal generation. In some cases the wall material 
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is made up of layers of different materials. For instance, the wall material may be made of a single 
conducting layer and a single non-conducting layer. Alternatively the wall material may be made 
of a single non-conducting layer surrounded by two conducing layers. Multiple layers and various 
combinations of materials are encompassed by the wall material of the invention. 
5 As used herein a "luminescent film layer" is a film which is naturally luminescent or made 

luminescent by some means of excitation or illumination, e.g., electrooptic thin films and high index 
films illuminated by internal reflection. 

A "conductive material" as used herein is a material which is at least conductive enough to 
prevent energy transfer between a donor and an acceptor. 
I o A "nonconductive material" as used herein is a material which conducts less than that amount 

that would allow energy transfer between a donor and an acceptor. 

A "light permeable material" as used herein is a material which is permeable to light of a 
wavelength produced by the specific electromagnetic radiation, quenching source, or the 
fluorescence excitation source being used. 
15 A "light impermeable material" as used herein is a material which is impermeable to light 

of a wavelength produced by the specific electromagnetic radiation, quenching source, or the 
fluorescence excitation source being used. 

A "channel" as used herein is a passageway through a medium through which a polymer can 
pass. The channel can have any dimensions as long as a polymer is capable of passing through it. 
20 For instance the channel may be an unbranched straight cylindrical channel or it may be a branched 
network of interconnected winding channels. Preferably the channel is a straight nanochannel or a 
microchannel. A "nanochannel" as used herein is a channel having dimensions on the order of 
nanometers. The average diameter of a nanochannel is between 1 nm and 999 nm. A 
"microchannel" as used herein is a channel having dimensions on the order of micrometers. The 
25 average diameter of a microchannel is between Ifim and 1mm. Preferred specifications and 
dimensions of channels useful according to the invention are set forth in detail below. In a preferred 
embodiment, the channel is fixed in the wall. 

An agent is attached to the wall material in such a manner that it will detectably interact with 
a partner compound by undergoing energy transfer or quenching with the partner light emissive 
30 compound which is passing through the channel of the wall material. In order to interact with the 
partner compound the agent can be positioned in close proximity to the channel. For example, the 
agent may be attached to the inside of the channel, attached to the external surface of the wall 
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material, attached to a concentrated region of the external surface of the wall material surrounding 
the rim of the channel, embedded within the wall material, or embedded in the form of a concentric 
ring in the wall material surrounding the channel. Optionally the agent may cover the entire surface 
of the wall material or may be embedded throughout the entire wall material. In order to improve 

5 signal generation when the agent is not localized, a mask may be used to cover some areas of the 
wall material such that only localized regions of agent are exposed. A "mask" as used herein is an 
object which has openings of any size or shape. More than one agent may be attached to the wall 
material in order to produce different signals when the agents are exposed to the partner agent. 

The agent may be attached to the surface of the wall material by any means of performing 

10 attachment known in the art. Examples of methods for conjugating biomaterials are presented in 
Hermanson, G. T., Bioconjugate Techniques, Academic Press, Inc., San Diego, 1996, which is 
hereby incorporated by reference. 

When the agent is attached to the surface of the wall material it may be attached directly to 
the wall material or it may be attached via a linker. A "linker" as used herein with respect to the 

15 attachment of the agent is a molecule that tethers a light emitting compound or a quenching 
compound to the wall material. Linkers are well known in the art. Commonly used linkers include 
alkanes of various lengths. 

The agent is attached to the wall material in an amount sufficient to detectably interact with 
a partner light emissive compound. As used herein a "partner light emissive compound" is a light 

20 emissive compound as defined above but which specifically interacts with and undergoes energy 
transfer or quenching when positioned in close proximity to the agent. The amount of partner light 
emissive compound and the amount of agent required will depend on the type of agent and light 
emissive compound used. 

Another example of an article of manufacture of the invention is a wall material having a 

25 surface defining a plurality of channels and a station attached to a discrete region of the wall material 
adjacent to at least one of the channels, wherein the station is close enough to the channel and is 
present in an amount sufficient to cause a signal to arise from a detectable physical change in a 
polymer of linked units passing through the channel or in the station as the polymer is exposed to 
the station. A "discrete region" of the wall material adjacent to at least one of the channels is a local 

30 area which is surrounded by wall materiual not having a station. The area surrounding the discrete 
region does not interact with the unit to produce the same characteristic signal produced by the 
interactionof the unit with the station. The discrete region is positioned in or near the channel such 



BNSDOCID: <WO 



98350 12A2 I > 



WO 98/35012 PCT/US98/03024 

-80- 

that the station at the dictreet region is exposed to the unit as it traverses the channel. 

An additional article of manufacture of the invention is a wall material having a surface 
defining a channel and a plurality of stations each attached to a discrete region of the wall material 
adjacent to the channel, wherein the stations are close enough to the channel and are present in an 
5 amount sufficient to cause a signal to arise from a detectable physical change in a polymer of linked 
units passing through the channel or in the station as the polymer is exposed to the station. 

As used herein a "plurality of stations" is at least two stations. Preferably a plurality of 
stations is at least three stations. In another preferred embodiment a plurality of stations is at least 
five stations. 

10 In a preferred embodiment the article of manufacture is a nanochannel plate. The following 

description of an optimal design of a nanochannel plate having fluorophores embedded within the 
plate is provided for illustrative purposes only. The example describes methods for optimizing 
several aspects of the article of manufacture. The description is in no way limiting of the article of 
manufacture claimed herein. 

15 Several examples of nanochannel plates are presented in Figure 4. Figure 4 A shows a 

nanochannel plate (60) having layers of conducting material (62) and non-conducting material (64). 
The channel (70) has a diameter that is sufficient to encompass the passage of double-stranded, 
labeled DNA in a linear fashion. In this example donor fluorophores (68) are embedded in the clear 
non-conducting material in a concentric ring around each channel. The remaining portion of the 

20 nanochannel plate is made up of a light impermeable material (66). 

Figures 4B, C, D, E, and K show a nanochannel plate having fluorophores (68) attached to 
the surface of the wall material surrounding the opening produced by the channel (70). As shown 
in Figures 4B and 4D the fluorophores may cover the entire surface of the wall material. The 
fluorophores may also be concentrated around the channel opening as shown in Figures 4C, E r and 

25 K rather than covering the entire surface. Additionally, the wall material supporting the fluorophores 
may be a conducting layer (62) such as that shown in Figures 4D and E, the wall material may be 
a light impermeable layer (78) as shown in Figure 4K or the wall material may be a support layer 
(72) as shown in Figures 4B and C. A support layer may be any type of wall material including but 
not limited to conducting, non-conducting, clear, light permeable, and light impermeable, 

30 Figures 4F, G, H, I, and L show a nanochannel plate having fluorophores (68) embedded in 

the wall material surrounding the channel (70). Again the fluorophores may extend across the entire 
wall material (shown in Figure 4F) or may be concentrated around the channel as shown in Figures 
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4G, H, and I. In the embodiments shown in Figures 4F, G, H, I, and L the fluorophores are 
embedded in a layer of non-conducting material (62) or of a clear material (74) or of a light 
permeable material (79). The layer having the fluorophores embedded within it may form the 
surface of the wall material as shown in Figures 4F and G or may be sandwiched between other 
5 layers. For instance the non-conducting layer (64) in Figures 4H and I is sandwiched between two 
conducting layers (62). The light permeable layer (79) of Figure 4L is sandwiched between two 
light impermeable layers (78). In some cases the layers shown form the entire wall material. In 
other cases the layers maybe adjacent to or sandwiched between supporting layers as shown in 
Figure 41. 

10 Figure 4J shows a nanochannel plate having fluorophores (68) attached to the surface of the 

wall material surrounding the opening formed by the channel (70). The material surrounding all of 
the exposed surf aces of the wall material, including the surface within the channel is a conducting 
material. 

Figures 4M and N show a nanochannel plate having a luminescent thin film (76) positioned 
1 5 within the wall material surrounding at least a portion of the channel (70). The luminescent thin film 
either forms the surface of the wall material and is adjacent to a light impermeable layer (78) as 
shown in Figure 4M or may be sandwiched between two light impermeable layers (78) as shown in 
f igure 4N. 

Figures 40 and P show a nanochannel plate having two layers of fluorophores (68) either 
20 embedded in the wall material or attached to the surface of the wall material. In Figure 40 the 
lluorophore layers (68) arc attached to the surface of a conducting material (62) on either side of the 
wall material and surrounding the openings formed by the channel. In Figure 4P the fluorophore 
layers (68) arc embedded in two light permeable layers (79) which sandwich a light impermeable 
la\er(78). 

25 A preferred method of the invetnion involves the analysis of radiolabeled polymers as 

discussed above. Preparation of radiolabeled polymers such as DNA (for example, with 32 P or 3 H) 
is known in the art. The following description represents one of the many possible embodiments 
of analyzing radiolabeled polymers according to the methods of the invention (Figure 4Q and R). 
A radiolabeled nucleic acid molecule (160) is analyzed with a single fabricated multilayered 

30 nanochannel (162). The nanochannel is the diameter of the labeled nucleic acid molecule and is 
constructed to yield a defined region of detection. The examplary nanochannel plates shown in 
Figure 4Q and R are a heterogeneous multilayered structure consisting of two radiation impermeable 
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layers such as lead or Lucite films (164, 166) and a film of lower density material (or scintillation 
layer) (168) (i.e., conventional polymers, polymethylmethacrylate, polystryrene, Teflon, etc.). The 
lead films in Figure 4Q sandwich the layer of lower density material and are of such thickness as to 
prevent passage of radiation. The lower density material permits passage of radiation, thereby 

5 creating a defined region of radiation detection. As the radiolabel on the DNA passes through the 
defined region of detection, nuclear radiation is emitted, some of which will pass through the defined 
region of radiation detection. Figure 4R shows a nanochannel plate having low density material 
(168) surrounding the opening formed by the channel. The material surrounding all of the exposed 
surfaces of the wall material including the surface within the channel is a radiation impermeable 

10 layer (165). 

In a related embodiment of analysis of radiolabeled nucleotides (Figure 5), a detection system 
based on scintillation counting and multiple nanochannels is presented. A nanochannel array (1 70) 
is fabricated as shown in Figure 5. Multiple multilayered channels (172) exist for parallel 
amplification of data output. Each individual channel consists of two nuclear radiation shielding 
15 layers (174) which shield nuclear radiation, and a scintillation layer (176) which is fluorescently 
excited with exposure to nuclear radiation. The individual channels are separated from each other 
by a nuclear radiation shielding material. The nuclear radiation is prevented from reaching the 
fluorescent detection system by overlaying with optical quality Lucite (or any other transparent 
material which prevents the passage of nuclear radiation). This allows only the fluorescent signal 
20 to reach the detection system. 

Each of the above described nanochannels is only an example. It is, therefore, anticipated 
that each of the limitations described with respect to these embodiments involving any one element 
or combinations of elements can be included in each nanochannel. Preparation of films having 
multiple layers of differing material have been described in the art, e.g., US Patent No. 5,462,467, 
25 Ferreira et. al., Thin Solid Films 244:806-809 (1994). 

In the example provided donor fiuorophores are concentrated in a concentric ring around 
each channel in order to optimize the donor intensity. Intuitively, a concentric ring is preferred 
because the range of energy transfer is limited in part by a radial Forster distance. Examining one 
channel in detail will illustrate the marked changes in donor intensities that can occur for the 
30 concentric ring configuration. Figure 6 shows a single channel (82) having the concentric ring 
configuration. The outermost concentric ring (86) marks the edge of the donor fiuorophores situated 
around the channel. The middle concentric area (84) denotes the region where the rate of energy 
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transfer from the donor to the acceptor is greater than the rate of emission of the donor fluorophores. 

The rationale for embedding the donor fluorophores in a solid mediurii is best understood by 
examining the mechanisms of photobleaching and quenching. The factors that may diminish a 
fluorescent signal include: photobleaching, temperature quenching, energy transfer quenching, 
5 collisional quenching, excited state reactions, and solvent effects. These mechanisms are similar in 
that they all arise from either collisional or bimolecular events. A solid medium is a physical barrier 
to these undesired molecular processes and is a means of isolation of the donor fluorophores. For 
example, the mechanism of photobleaching is due to a reaction between an excited fluorophore and 
oxygen to form a non-fluorescent product (Menter et al., 1978; Menter ct al. 1979; Giloh and Sedat, 

10 1982). A solid medium, especially one manufactured in an oxygen-free environment, prevents 
oxygen from reaching the embedded fluorophores, eliminating possible oxidative reactions (Garland 
and Moore, 1979; Rost, 1990). In order to understand the effect of a solid medium further, a brief 
summary of the other quenching effects is presented. Temperature quenching is a decrease in 
fluorescence as temperature increases. The degree of temperature dependence depends on the 

15 compound; it is usually about 1% change in quenching per degree Celsius (Guilbault, 1973). The 
effect is believed to. be due to increased molecular motion and increased frequency of collisions, 
resulting in increased probability of transition to the ground state before fluorescence can occur. 
Collisional quenching is a broad category consisting of many possible mechanisms involving direct 
contact between a fluorophore and another molecule (Lakowicz, 1983). Excited state reactions 

20 involve the reactive excited state of a fluorophore in a reaction with nearby solvent molecules and 
constitutes a subset of collisional quenching (Porter, 1967; Zweig, 1973). Solvent effects consists 
of solvent-fluorophore collisions and interactions, including effects of hydrogen bonding, acid-base 
chemistry, and charge transfer interactions (Lakowicz, 1983). Energy transfer quenching is due to 
the effects of impurities on fluorophores via undesired FRET. From this short listing of 

25 photobleaching and quenching mechanisms, it is clear that molecular interactions with fluorophores 
are the main causes of diminished fluorophore emissions. A solid media thus eliminates these 
undesired molecular interactions, collisions, and reactions by isolating the fluorophores and creating 
a physical barrier that prevents the entry of undesired quenching molecules. 

The equation defining energy transfer should be examined in order to determine the effect 

30 of a solid medium on energy transfer. The equation that needs to be considered describes the Forster 
distance (Selvin, 1995; Cantor and Schimmel, 1980; Wu and Brand, 1994; Clegg et al., 1995; 
Fairclough and Cantor, 1978): 
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J is the normalized spectral overlap of the donor emission and acceptor absorption, § D is the quantum 
efficiency (or quantum yield) for donor emission in the absence of acceptor (<|) D is the number of 
photons emitted divided by number of photons absorbed), n is the index of refraction, and k 2 is the 
geometric factor related to the relative angle of the two transition dipoles. Embedding the donor 
fluorophore in a solid medium raises w, the index of refraction. In solvent systems, n is taken to be 
around 1 .35 (Selvin, 1 995), slightly higher than the value for water (1 .33). Some typical values for 
solid media are 1 .46 for fused quartz, 1 .52 for crown glass, and 1 .66 for dense flint glass. A ratio 
of the R f *s for a solid medium and a solvent system can be used to determine the magnitude of the 
effect of changing the index of refraction: 



K _ 

R o 



n 



2 3 



10 where the primed values are the ones for the solid medium and the unprimed ones are for a solvent 
system. Assuming n ' = 1 .5, the ratio becomes 0.93. The Forster distance thus only changes by 7%, 
a value that can be easily corrected for by using donor-acceptor pairs that have a higher original R (t . 

Referring again to Figure 6, conceptually, without presenting equations, the rate of energy 
transfer (A uanstcr ) from the center of the channel falls off with increasing distance from the center 

15 (Figure 6). It is also known that the rate of donor emission (k cmif ) is uniform over the complete area 
of donor fluorophores in the absence of acceptors (Figure 6). The area closest to the channel, and 
thus to the acceptor, is quenched completely. At all areas less than R equa |, the donor fluorophore is 
quenched completely because £ transfer is greater than k emit . At areas greater than the R equah the rate of 
donor emission is greater than the rate of energy transfer so quenching is incomplete. 

20 If the concentric ring of donor fluorophores has a radius equal to R equa! , then a hundred 

percent intensity change can occur for the donor fluorophores. This means that upon the exit of an 
acceptor labeled nucleotide through the channel, the acceptor is detected with perfect efficiency. 
Recalling that confidence intervals are related to the SNR of the system, the minimum signal to noise 
ratio needed to generate a 99.9% confidence interval for such a change is calculated to be 6:1 . The 
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SNRs for the detection systems proposed are greater than 600: 1 . 

A quantitative explanation is presented at this time to calculate Re qua |. The equations for the 
rates of donor emission and energy transfer are presented. The rate of donor emission (k cwh ) is as 
follows: 

k - i£fi£ 

-* ~~ hv 

5 where / is the intensity of light incident upon the donor fluorophores, e is the molar extinction 
coefficient of the donor fluorophore, p is constant for fluorescence emission (3.8 x 10 21 mol cm 3 /L), 
N is the number of donor fluorophores, h is Planck's constant (6.6261 x 10 34 J s). and o is the 
frequency of excitation light. 

The rate of energy transfer (k tramfcr ) from many donors to one acceptors is derived from the 

10 original Forster rate equation (Forster, 1965) for one donor and one acceptor. The original equation 
is given as: 



( \ 

R 



r J 



where k is the rate of energy transfer from one donor to one acceptor, R (t is the Forster distance, x 
is die fluorescence lifetime of the donor, and r is the distance from the donor to the acceptor. The 
derivation of a multi-donor system is straightforward and follows from the discussion of the multi- 

1 5 donor system described above. 

There are two limitations in the amount of energy transferred for a multi-donor system. First, 
ihere is the saturation limit imposed by the lifetime of the acceptor. The acceptor is only able to be 
excited whenever it is in the ground singlet state. For an acceptor with a lifetime of 1 ns, the upper 
limit is 1 x 10 9 excitation events/second. This large saturation level is hardly a concern given the 

20 low rate of excitation for. a single fluorophore (25,000 excitation events/second). The second 
limitation is the probability of simultaneous de-excitation of donor fluorophores. As calculated 
above only a very small number of simultaneous de-excitation events can occur. Since the acceptor 
is not saturated with excitation events resulting from energy transfer and the probability of 
simultaneous donor de-excitation is small, the rate of energy transfer for a multi-donor system is 
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directly proportional to the number of donor fluorophores (N) : 



transfer 



N_ 



O 

\ r ) 



With this equation, the radius at which k lnmsJcr is equal to k emit (R^) is found by equating the 
two rate equations and solving for r. 



^ emit ^transfer 



hv 
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5 Solve for r. 



r = R 



equal 



R 



A numerical value for R cqua i can be calculated. Table 3 lists the values for the variables and the 



reason the particular value is chosen. 



Tabic 3 


variable 


value 


reason 


R 0 


60A 


range for Forster transfer is 20 A - 100A 




1 x 1 0" 9 s/photon 


fluorescent lifetimes range from 1 ns - 20 ns 


h 


6.6261 X 10' 34 Js 


Planck's constant 


V 


6.1224x10 14 xs' 1 


c=vA; A=490nm (excitation of fluorescein) 


1 


30 W/cm 2 


intensity of 2 W laser is given by P/A. A=beam area (r=2mm). 
Intensity of laser is 64 W/cm 2 


8 


9l f 0001/M cm 


molar extinction coefficient for fluorescein 


P 


3.8 x 10-21 M cm 3 


constant for fluorescence emission of fluorescein 


Rfnnai 


350A 


from above equation 



Re qua , is calculated to be 350 A, within an order of magnitude of the Forster distance. This 
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means that a concentric ring of fluorophore around a channel with a radius equal to 350 A will give 
rise to a hundred percent change in signal intensity upon the passage of an acceptor label. In 
practice, having such a large donor intensity decrease is unnecessary from both the standpoint of 
signal detection and the need to resolve adjacent bases. For example, with a SNR of 600: 1 , only a 

5 0.50% intensity change will give rise to a 99.9% confidence interval. Resolution between adjacent 
bases is possible by looking at further decreases in the donor emission when there are two acceptors 
instead of one in the donor layer. If one acceptor already decreases the donor emission to zero, then 
an additional acceptor will not be detected because the donor emission cannot decrease further. This 
aspect is discussed in detail below. An example of a method for constructing a concentric ring of 

1 0 donor fluorophores around each channel by using a photolabile protecting group and light diffraction 
is also provided below. 

Nanochannels having a channel diameter size of at least 1 nm are commercially obtainable 
in the form of polycarbonate filters from Poretics, Inc. and can be made on order by Corning 
Separations, Inc. There are several methods that can be used to create nanochannels of the desired 

15 diameter. 

One method for preparing a nanochannel plate is by a track-etch procedure which produces 
cylindrical pores of uniform diameter in a membrane material. Microporous and nanoporous 
polymeric membranes having pore diameters on the order of 10 nm and with pore densities 
approaching 10 9 pores per square centimeter can be prepared by the track-etch method (R.L. 

20 Fleischer, P.B. Price, R.M. Walker, Nuclear Tracks in Solids (Univ. of California Press, Berkeley, 
CA (1975)). The manufacture of pores via track-etch is a two step process. In the first step, thin 
polycarbonate (or other polymeric material) film is exposed to collimated, charged particles in a 
nuclear reactor. As these particles pass through the polycarbonate material, they leave sensitized 
tracks. The density of the tracks is controlled by varying the amount of time the film is in the 

25 reactor. In the second step, the tracks left by the particles are preferentially etched, or dissolved, into 
uniform, cylindrical channels. The diameters of the perforations can be controlled by the residence 
time of the etchant on the film. Many examples of methods for forming track etched membranes 
have been described in the art, e.g. European patent Application No. 83305268.1, Publication No. 
0109147, to Varian Associates, Inc., U.S. Patents Nos. 3,303,085; 3,662,178; 3,713,921; 3,802,972; 

30 3,852,134, 4,956,219, 5,462,467, 5,564,959 and 5,449,917, each of which is incorporated herein by 
reference. 

The commercially available membranes are generally prepared from polycarbonates or 
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polyesters; however, a number of other materials are amenable to the track-etch process (Id.). For 
instance, other polymeric materials include but are not limited to polystyrenes, aromatic polyesters, 
polyolefins, including polyethylene, polyethylene terephthalate, polypropylene, vinyl plastics such 
as polyvinyl difluoride (PVDF), and cellulose esters such as cellulose nitrate, cellulose butyrate and 
cellulose acetate. If the nanochannel plate of the invention is prepared by a track-etch technique it 
may be formed from any material capable of being track etched can be used to form the track etched 
membrane. 

Devices for performing bombardment of materials with high energy particles are well known 
in research and industry. The particles used to form the tracks may be generated by a charged 
particle accelerator, such as an electrostatic accelerator (e.g., a Van de Graaff accelerator or Tandem 
accelerator), a linear accelerator or a cyclic accelerator such as a cyclotron or any other means known 
in the art. 

Once the damaged track is formed in the film the channels or pores are formed by selectively 

etching the film with a gas or liquid. The residence time of the etchant determines the size of the 

5 channels. The track etched film is exposed to the etchant for sufficient time to generate channels that 

are sized to match the desired application, which varies depending on the type of polymer being 

analyzed and the type of analysis. The channel diameter can be measured using a scanning electron 

microscopy (SEM) according to methods disclosed in Basic Principles of Membrane Technology, 

M. Mulder, Klumer Academic, 1991. 

o A second method of creating nanochannels of defined diameter is to use a combination of 

track-etching and surface coating. A polycarbonate membrane of a diameter greater than the desired 

nanochannel device is coated with a thin film of material with a defined thickness. The resulting 

structure is a polycarbonate membrane surface coated tp the desired diameter. The first layer of thin 

film that is added to the nanochannel plate is a conducting layer. A conducting layer helps to resolve 

!5 adjacent bases (which is discussed below in more detail). 

Thin layers of conducting polymers are added to the polycarbonate membrane through 

solvent deposition. Solvent deposition of conducting polymers have been described (Cheung et al., 

1994; Fereira et al., 1994; Fereira and Rubner, 1995; Fou and Rubner, 1995). The following is 

excerpted from Fou and Rubner, 1 995 : 

30 We describe the solution chemistry and methodologies needed to utilize the layer-by-layer 

processing technique described... to manipulate conducting polymers such as polypyrrole and 
polyaniline into multilayer thin films with angstrom-level control over both film thickness 
and film architecture. Ultrathin films with conductivities over 300 S/cm can be made. 
The process involves the spontaneous adsorption of monolayers of electrically conductive 



'l " t 

WO 98/35012 PCTYUS98/03024 

. -89- 

polymers onto substrates from dilute solutions. Subsequent multilayer thin films are created 
by alternate deposition with soluble polyanions. The thickness of the thin films can be 
precisely controlled to the angstrom level and can range between 5 A and greater than 1000 
A. The conductive polymers used are polypyrrole and polyaniline because these can be made 
5 extremely conductive (300 S/cm) for ultrathin layers (-50 A). 

The advantage of using a solvent deposition method to create the desired nanochannels is 
two-fold. First, nanochannels of any particular diameter can be created with precision and accuracy. 
Second, since the first layer of the nanochannel plate is a conducting layer, it is only convenient to 
10 add it. An additional advantage, though not significant, is that addition of a thick layer of 
conducting material can lower the peak-to-valley distance of the polycarbonate surface, creating a 
more uniform surface. 

Another method for preparing a nanochannel device of the invention is through the 
production of arrays of carbon nanotubes. Iijima demonstrated the production of multiple concentric 
15 cylindrical shells of hexagonally bonded carbon atoms which can serve as catalytic surfaces to 
confine species in a 1 -dimensional space. Iijima, Nature, 354:56 (1991), see also US Patent No. 
4,424,054. 

Li, W. Z., et. al. 3 has also reported a method for producing large areas of highly ordered, 

isolated long carbon nanotubes. The method is based on a chemical vapor deposition which utilizes 

20 mesoporous silica containing iron nanoparticles embedded in the pores rather than carbon black and 

graphite or silica covered with transition metal nanoparticles. The following method is disclosed 

in Li, W. Z. etal.: 

Mesoporous silica containing iron nanoparticles were prepared by 
sol-gel process from tctraethoxysilane (TEOS) hydrolysis in iron nitrate 
aqueous solution. Analytically pure TEOS (10 ml) was mixed with 10.4 
ml of analytically pure ethyl alcohol and 0.1 M iron nitrate aqueous 
solution (1 1.4 ml) by magnetic stirring for ~ 30 min. A few drops of 
concentrated hydrogen fluoride (0.2 ml) were then added, and the mixture 
was stirred for 15 min. After gelation of the mixture, the gel was dried 
for 1 week at 60 °C to remove the excess water and other solvents. The 
gel was then calcined 10 hours at 450°C at 10' 2 ton*. A silica network 
with relatively uniform pores was obtained with iron oxide nanoparticles 
embedded in the pores. The iron oxide nanoparticles were then reduced 
at 550°C in 180 ton* of flowing 9% H 2 /N 2 (1 10 cmVmin) for 5 hours to 
obtain iron nanoparticles, which have a high catalytic activity. 
Subsequently, a mixture of 9% acetylene in nitrogen was introduced into 
the chamber at a flow rate of 1 10 cmVmin, and carbon nanotubes were 
formed on the substrate by deposition of carbon atoms obtained from 
decomposition of acetylene at 700 °C. The samples were examined by a 
scanning electron microscope (SEM) (S-4200, Hitachi) before and after 
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carbon deposition, and energy-dispersive x-ray spectra (EDX) were 
recorded by a SiLi detector attached to the SEM. To prepare a 
transmission electron microscope (TEM) specimen, the sample was 
ground in a mortar and suspended in ethanol; a drop was then placed on 
5 a holey carbon copper grid and examined in a JEM 200-cx microscope to 

characterize the carbon nanotubes. 

Additionally, nanochannels may be prepared from anodic porous alumina which is a packed 
array of columnar hexagonal cells with central, cylindrical, uniformly sized holes typically ranging 
10 from 4 to 200 nm in diameter. Membranes of this type are prepared electrochemically from 
Aluminum metal (A. Despic and V.P. Parkhutik, in Modern Aspects of Electrochemistry, J.O. 
Bockris, R.E. White, B.E. Conway, Eds. (Plenum, New York, 1989), vol. 20, chap. 6.). Pore 
densities as high as 10 n pores per square centimeter have been achieved (D. AlMawiawi, N. 
Coombs, M. Moskovits, J. Appl Phys. 70, 4421 (1991)). Membranes having pore diameters as 
15 small as 5 nm have been prepared using this method (and it is believed that even smaller pores can 
be prepared). Martin, C.R., Science, 266:1961 (1994). 

Matsuda and Fukuda have described a modification of porous alumina membranes. The 
membranes which are a highly ordered metal (platinum and gold) nanohole array are prepared using 
a two-step replication of the honeycomb structure of anodic porous alumina. Preparation of the 
20 negative porous structure of porous alumina followed by the formation of the positive structure with 
metal results in a geometrical structure identical to that of anodic porous alumina. The method, 
therefore allows the preparation of the hole array of anodic porous alumina with desired materials 
other than. alumina. Matsuda and Fukuda, Science, 268:1466 (1995). Matsuda and Fukuda's 
procedure is summarized below. 
25 Anodic alumina was produced using a long period anodization of alumina on a substrate at 

40 V which results in a minimum number of defects and dislocations, followed by removal of the 
aluminum substrate and the bottom part of the porous layer with saturated HgCl 2 . The material was 
then dipped in 5% (by weight) phosphoric acid solution at 30°C to adjust the pore diameter. A thin 
metal layer was deposited on the bottom of anodic porous alumina by vacuum deposition in order 
30 to create a catalyst or electrode for the subsequent metal-plating process. Generally the same metal 
as that used to create the scaffold is used for the evaporation. Metal methacrylate monomer 
containing a polymerization initiator such as 5% (by weight) benzoyl peroxide was injected into the 
holes under vacuum conditions and was polymerized by ultraviolet irradiation. The alumina layer 
was then removed with NaOH to produce a negative porous alumina-type structure of poly(methyl 
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methacrylate) (PMMA). A positive structure was formed from the PMMA mold by electroless 
deposition of platinum as follows. The negative type of PMMA was dipped into the electroless 
plating solution, causing metal deposition to start at the bottom part of the cylindrical structure and 
the metal to gradually fill the cavity of the PMMA to the top of the negative type of PMMA. 
5 Alternatively a gold (Au) hole array was prepared using electrochemical deposition of Au into the 
microcavity of the PMMA under constant conditions. The porous metal was obtained by dissolving 
the PMMA with acetone. 

Other nanoporous materials which have been described in the art include a nanochannel array 
glass with pore densities as high as 3 X 10'° pores per square centimeter (R.J. Tonucci, B. L. Justus, 
10 A.J. Campillo, C.E. Ford, Science 258, 783 (1992) and Pearson and Tonucci, Science, 270: 68 
(1995)). Douglas et al. have shown that the nanoscopic pores in a protein derived from a bacterium 
can be used to transfer an image of these pores to an underlying substrate (Douglas, et. Al., Science 
257: 642 (1992)). Finally, Ozin has discussed a wide variety of nanoporous solids that can be used 
as template materials (Ozin, G. Adv. Mater. 4:612 (1992)). Nishizawa et. al., describe the production 
15 of metal nanotubulc membranes having radii as small as 0.8 nm (Nishizawa et. al., Science 268:700 
(1995), describing nanotubules formed by plating gold onto the walls of pores in a commercially 
available polycarbonate filtration membrane (Poretics) containing cylindrical nanopores of uniform 
radius (25 nm, 6 X 10 8 pores per square centimeter) running through the complete thickness (6 jim) 
of the membrane. "The thickness of the Au layers deposited on the pore walls can be controlled by 
20 varying the plating time. As a result, the inside radius of the Au nanotubules can be varied at will 
[as determined from measurements of gas (He) flux across the tubule-containing membrane.]" 
Berggren et. al., have demonstrated techniques for nanolithography using self-assembled monolayers 
and a beam of neutral inert gas. Berggren et. al., Science 269: 1255-1257 (1995). 

La Silva et al., describe a technique for fabricating simple metal structures with a scanning 
25 tunneling microscope (STM) which have dimensions of 1 0's to 1 00's of nanometers and are partially 
electrically isolated from their environment. The method is performed by depositing a very thin 
metal film on an insulating substrate, and using the tip to machine gaps through the film where 
lateral electrical insulation is desired. (Journal of Vacuum Science & Technology B, (1993) 
11:1992-1999) 

30 The wall material may be constructed in a manner which is optimal for resolving adjacent 

units of the polymer. Since the purpose of the wall material is to provide an environment which is 
conducive to generating a signal, the materials used to prepare the wall material may be selected to 
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aid in this process. For instance, the wall material surrounding the agent if the agent is embedded 
in the wall material preferably is a non-conducting or light permeable material. At least two other 
mechanisms for optimizing the wall material in an apparatus having fluorophores embedded in the 
wall surrounding the channel, in order to resolve adjacent labeled bases include the use of thin 
5 conducting layers and controlling the radii of donor fluorophores around each channel. 

Conducting layers prevent Forster energy transfer through electrical shielding. Figure 6 
demonstrates a configuration of the conducting layers relative to the non-conducting layer which 
contain the donor fluorophores. The conducting "sandwich" creates a defined region where energy 
transfer can occur and this helps to optimize base resolution. Forster energy transfer arises because 
10 of an electrical dipole-dipole interaction between a donor and acceptor. The rationale for the use of 
conducting layers when the signal generated is dependnet on FRET is that FRET is electric field- 
dependent and thus placing an electric shield between the donor and acceptor will prevent energy 
transfer. It is necessary to understand fully the basis of energy transfer. Incident excitation light 
creates an electric field in the donor because the light induces transitions in the donor, or causes 
15 electrons in the donor to oscillate (Selvin, 1995). This creates an induced, electric dipole moment 
in the donor, which in turn, creates its own electric field. Energy transfer arises when an acceptor 
is placed in the donor's electric field. As a consequence, there are induced transitions in the acceptor 
which create an induced dipole moment, p A . The size of the dipole moment is related to the size of 
the donor electric field: p A =a A E 0 , where cc A is the polarizability of the acceptor. The amount of 
20 energy absorbed by the acceptor is p A E jy> = a A E t) \ which translates into the 1/R 6 dependence of the 
rate of energy transfer. 

The theory of electrical shielding is found in most introductory physics books. Purcell, 1985 
provides a clear explanation. The potential function inside the box, i|' (*> >'> *)» must satisfy 
Laplace's equation, T 2 \|/ = 0. Given the knowledge of conductors, the boundary of the conductor 
25 is an equipotential, meaning that = y 0 a constant function everywhere on the surface of the 
conductor. An obvious solution to Laplace's equation is \y = \\? tl throughout the volume. According 
to the uniqueness theorem, there can only be one solution, meaning the answer is \\i = a constant. 
The electric field of a constant potential function is zero because E = -grad V)/. The electric field is 
thus zero everywhere inside the box. 

30 The model of a box with an open hole can be applied to a nanochannel plate because 

essentially a nanochannel plate is a conducting box with many "holes". Consider the Figure 7A and 
7B. Figure 7 A shows a complete nanochannel plate having nanchannels (90). The layer of donor 
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fluorophores (88) is enclosed -by the conducting *Mi*<3x;" In this manner, tliere cannot be energy 
transfer to the donor fluorophores from outside the box because of electrical shielding (92). Recall 
that the basis of energy transfer is electric. At the boundary condition of the nanochannel, which is 
the outer surface of the nanochannel plate, \j/ = an equipotential surface. A logical solution to 

5 Laplace's equation is that everywhere inside the boundary condition that , \|/= constant. From the 
uniqueness theorem, this is the only possible answer and thus everywhere inside the conducting 
nanochannel "box," E = 0 because E =y-grad . The conclusion is that acceptor fluorophores cannot 
undergo energy transfer from outside the box. Even inside a nanochannel, as shown in Figure 7B, 
energy transfer is limited geometrically. At any position where the acceptor molecule is not in the 

10 plane of the donor fluorophores, the amount of energy transfer is limited. In Figure 7B, the amount 
of donor fluorophores that cannot undergo energy transfer is enclosed by an angle of 0, which for 
all purposes is very large unless the acceptor fluorophore is directly in the plane of the donor 
fluorophores, which is the desired configuration. 

The consequence of the conductive layers is that a sharp signal is created upon the passage 
15 of a labeled nucleotide through the nanochannel. Figure 8A and Figure 8B demonstrate signal 
generation upon passage of an acceptor label through the nanochannel. Figure 8A shows an enlarged 
view of one nanochannel (98). Only part of the conducting layers (94) is shown. The light 
impermeable polycarbonate layer is not shown. An acceptor label (100) on a strand of DNA moves 
through the nanochannel from bottom to top, starting at position A and moving to position C. FRET 
20 can only occur at position B because the conducting layers shield any interaction of the donor 
fluorophores (%) with the acceptor at positions A and C. Figure 8B graphically illustrates the sharp 
changes in donor intensity as an acceptor label moves from positions A to C. The middle region on 
the graph denotes the level of the donor fluorophores. The outer regions on the graph denote the 
level of the conducting layer. At the interface of the conducting and donor fluorophore layers, there 
25 arc dramatic changes in donor intensity due to electrical shielding. 

It is clear that resolution between adjacent bases can be resolved with conducting layers. By 
creating a conducting "sandwich" where the thickness the donor fluorophores is less than the helical 
rise of B-DNA (3.4 A), the desired resolution can be achieved. Thin films of this thickness can be 
constructed easily with plasma, solution, chemical vapor, or ion beam deposition methods (Spohr, 
30 1900; Valiev. 1992; Konuma, 1992; Pauleau, 1995; Bruno et al., 1995; Dash, 1975; Stuart, 1983; 
Morosanu, 1990) However, it is not desirable to use a donor fluorophore film less than 3.4 A, as will 
be explained under the next heading. In brief, donor fluorophores embedded in a thicker layer 
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allows the measurement of instantaneous rate of movement of DNA. By measuring the time a 
labeled nucleotide spends in the thick layer and knowing the dimensions of the layer, the rate of 
DNA movement is known, which is important for determining distances between labeled 
nucleotides. 

5 Another example of a method for precisely resolving adjacent labeled bases is to control the 

radii of the donor fluorophores around each nanochannel. The amount of energy transferred for two 
acceptor labels in the presence of a concentric ring of donor fluorophores is greater than the energy 
transferred for one acceptor label. Detection of the difference in energy transferred for one and two 
acceptors allows the resolution of adjacent bases. In order for the donor fluorophores to be able to 

10 interact with more than one acceptor, the thickness of the donor fluorophores has to be greater than 
the helical pitch of DNA. Furthermore, the radii of the donor fluorophores must be greater than R^, 
for one acceptor (see Figure 6). A radii greater than R cqua , for one acceptor allows for further 
decreases in donor intensity in the presence of more than one acceptor. A radii at R^, means that 
upon that passage of one acceptor, the donor intensity decrease is equal to 100%. In this case, 

15 passage of two adjacent acceptors gives the same detected signal as one acceptor. 

Figure 9 schematically demonstrates the passage of a two-base labeled strand of DNA (102) 
through a nanochannel (104) with the proper thickness and radii of donor fluorophores (108) 
sandwiched between conducting material (106). The positions labeled "A" through "D" correspond 
to the labels on the graph shown at the right of the illustrations. Initially, the acceptor labels on the 
20 DNA are at position A. Energy transfer is not possible at this position so donor intensity remains 
at a maximum. Further movement of the DNA allows one acceptor to undergo energy transfer with 
the donor fluorophores (B) and a sharp decrease in the donor intensity occurs. At position C, two 
fluorophores can undergo energy transfer yielding a further decrease in the donor intensity. Finally, 
the two acceptor labels exit the region of donor fluorophores, energy transfer is not longer possible, 
25 and the donor intensity returns to a maximum (D). 

The change in donor emission in the presence of one and two acceptors can be visually 
demonstrated without mathematical quantitation. Figure 10A illustrates the amount of change as 
volumes. The illustrations show the amount of energy transfer as a solid volume (1 10). The original 
donor intensity is represented as the volume in the shape of a disc (112). Figure 10B shows the 
30 change for one acceptor. The right illustration is for two acceptors. The decay curves represent the 
rate of energy transfer with respect to radial distance as given by Forster's equation. The decay 
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curve for two acceptors is roughly double that of one acceptor. The rates of donor emission and 
energy transfer are expressed in units of 1/s run 2 . Integration of the rate of donor emission decrease 
over the surface area of energy transfer yields the net decrease in donor emission. This is 
represented as the shaded area under each decay curve integrated over 2 tc. Knowing the original 
5 donor emission (shaded rectangular area integrated over 2 7t), the percent decrease in donor emission 
can be found. 

Mathematically, the changes can be easily calculated. In both cases, the change is equal to 
the striped area (Figure 1 0) integrated over 2 n. To calculate, the rates of donor emission and energy 
transfer have to be expressed in the appropriate units of 1/s nm 2 . To do so, the density of the donor 
10 fluorophores, given as N/A, is to be used in the rate equations, where N n is the number of 
fluorophores and A is the area which is occupied by the donor fluorophores. Accordingly, the rate 
of emission is given as: 

"emit 



hvA 



The rate of energy transfer becomes: 



transfer 



< R ^ 



Where N A is the number of acceptors that can undergo energy transfer with the donors. The 
15 general equation for the striped areas integrated over 2 n follows: 



For the present calculations, R^, = 55 nm, N D =1000. The donor fluorophore density (N L /A) 
becomes 0.1 1 fluorophores/nnr. This value is not unreasonable because the size of the area of the 
largest possible fluorophore is 1 nm 2 . This means that the density is at least an order of magnitude 
lower than the highest possible fluorophore density. 
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In order to solve the general equation, recall that can be solved by setting k,, mit = k trans fc r , 
resulting in the following: 



R = ±_ 

equal 6 



^ D lZp/N A hv 

It is also important to know the original donor emission (E {) ), given as: 



E G " k emii 11 ^ donor R channel) 

Using the values from table 3 and from above, it is possible to tabulate donor emission values 
tor different numbers of acceptors that are in the position to undergo energy transfer (table 4). 
Values for =1-5, 1 1-12 are calculated as examples. It is expected that the value of the first donor 
emission decrease be the greatest. Subsequent decreases become progressively smaller. When R^, 
approaches R^ onon it is expected that actual (not percent) change approaches zero. This is so because 
essentially the donor molecules are almost completely quenched. 

The percent changes and the signal-to-noise ratios determine the detection capability, not the 
absolute numerical changes in donor emission. As expected, the SNR decreases as the number of 
acceptors increase because there is greater donor quenching. Otherwise stated, the donor emission 
becomes smaller. This decrease in SNR is compensated by increasing percent changes from N A = 
5 to 1 2. The confidence is calculated by using the SNR and the percent change. For example, the 
confidence for detecting the change from one to two acceptors uses the SNR for one acceptor and 
the percent change from one to two acceptors. In this case, there is a 95% confidence for detecting 
a 0.4X3% change. Since the percent change is high, 29.1%, the signal change from one to two 
acceptors is detected with a 100% confidence. The calculations from above demonstrate that 
multiple adjacent acceptors can be detected with high efficiency. 



|~ Table 4 




1 N * 


donor emission (photons/s) 


% change 
from previous 


SNR (80% full well 
capacity at E 0 , N A =0 


confidence 


1 0 


2.5572 x 10 7 




632:1 


-100% 
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3 


5.7650 x 10 6 


25.6% 


300:1 


H n 


4 


4.3560 x 10 6 


24.4% 


261:1 


U n 


5 


3.2440 X 10 6 


25.5% 


225:1 


lit* 


// 


// 


// 


// 


// 


11 


3.8200 x 10 5 




77:1 


mi 


12 


2.1200 x10 s 


44.5% 


58:1 





10 An assumption made in the above calculations is that the donor emission remains constant 

during the passage of one fluorophore through the donor fluorophore layer. Recall that the donor 
lluorophore layer may consist of a monolayer of fluorophores embedded in a clear non-conducting 
medium. The range of energy transfer for an acceptor close to the exit of the nanochannel is less 
than the range upon initial entry into the channel. This change is in fact significant, as shown 

15 mathematically below. Taking into account the change yields higher signal-to-noisc ratios and 
greater changes in donor intensity for additional acceptors. This means that the values in table 4 
which show very efficient signal generation/detection already are actually even slightly higher. 

The change in donor emission upon passage of an acceptor through the donor fluorophore 
layer is determined by calculation in the following formulas. It is expected that the amount of 

2t> energy transfer decreases as an acceptor passes through the donor layer due to a smaller effective 
energy transfer range. The consequence of this decrease is that the donor emission is greater than 
in the previous calculations. A higher donor emission means a higher SNR. Suppose an acceptor 
has entered the donor layer a small distance. A short time later, another acceptor enters the donor 
layer. The presence of an additional acceptor yields a decrease in the donor emission. The 

25 percentage change is large. In fact, it is larger than the previous calculations. The combination of 
higher SNRs and greater percentage changes mean that detection efficiencies are greater than those 
previously estimated. Complex patterns of labeling, such as labeling every several nucleotides, can 
be distinguished using this system. 

The equation of donor emission when the acceptor is on the same plane as the donor 
30 fluorophores is given as: 



BNSOOCID <WO 9935012A2 I > 



WO 98/35012 



-98- 



PCT/US98/03024 



mit n (Rdonor & channel) i k emii 71 ^ equal ^channel) + lo J R mquai * transfer 9 drtl 



Recall the equations for k cmi , and k transfer : 



m P N D , _ N D R 0 , 



k = _ u k = —(—) 

hvA transf " x^A [ r 

Express the original donor emission function in terms of the radial distance (x) and distance 
of acceptor from donor layer (d) with the following substitutions: 



dr = - dx r = y/x 2 + d 

Jx 2 + d 2 



emtt 



The resulting equation, together with the substitution of the equation for k lransfcr yields: 



N R 6 

iM R Lor - Rlanned-We m iMx 2 equQl ~ Runnel) + 2H-^- / " * 



Let u= x 2 + d 2 . It follows that dx = du/dx. 



*enuM R Lor ' banned ~ l^iM^uai ' R channel) ' ^ ^ ' 



k emi( T '{ R donor " R channel* ~ \ k emit 11 ^ equal ' R channel* + ( 271 ~ ~)("^~^ " ^ ^ 2 . J 
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Substitute x equal = \/R 2 equa | - d 2 and the resulting emission function becomes: 



N 6 

= ^"(«L - *LnnJ ~ ^ emU nR^, - d 2 - R^J) + (2Tt-^)(— L— - — ! )] 

donor ' 



The resuling donor emission function can be plotted versus distance. 



Distance 


Donor Emission 
(photons/s) 


% Change from Original 


0 


1.09425 x 10 7 




20A 


1.09740 x 10 7 


0.023 % 


40A 


1.0692 x 10 7 


1.158% 


60A 


1.12278 x 10 7 


2.607 % 


80A 


1.14501 x 10 7 


4.638 % 


100A 


1.17365 x I0 7 


7.256 % 



N 6 



equal 



In order to find solutions to the above equation, R equal needs to be found by equating k erT 
to the sum of energy transfer of the two acceptors. Solving for R cqiK1 , in the below equality is 
1 5 done by computer: 



Jt>pN n N n R N n R* 

k = k +k -t r D = ° ( ?_\ 6 + D 

K emit * transferi \) *transferm ^ _ ) 



emit transfer(l) transferal) , v ^ 
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The percent chage in donor emission from one to two acceptors is tabulated. 



Distance (d) 


Requai 2 acceptos 


E 2 (d) < 


E t (d) 


% change 


SNR 


OA 


39.28 


7.7510x10 8 


1. 0943x1 0 7 


29.10% 


348.0:1 | 


20A 


39.21 


7.7661x10 s 


1. 0974x1 0 7 


29.24% 


348.3:1 


40A 


39.11 


7.8102x10° 


1.1 069x1 0 7 


29.44% 


349.3:1 


60A 


39.01 


7.8815X10 8 


1.1228x1 0 7 


29.80% 


350.9:1 


80A 


38.82 


7.9790x1 0 8 


1. 1450x1 0 7 


30.31% 


353.1:1 


100 A 


38.610 


8.0962x1 0 8 


1.1 737x1 0 7 


31.01% 


355.7:1 



% change = ±L±xlOO 



Therefore base resolution down to individual bases can be achieved. A corollary to the base 
resolution argument is that the time spent in the donor layer by an individual acceptor can be 
determined. This translates into information about instantaneous rates of DNA movement. 



2-S 

t-S 
1 



20 



In order to achieve optimal linear crossing of a polymer across a channel it is important to 
consider the channel diameter as well as the method used to direct the linear crossing of the 
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polymer e.g., an electric field. The diameter of the channels should correspond well with that of 
the labeled polymer. The theory for linear crossing is that the diameter of the channels 
correspond well with that of the polymer. For example the ring-like sliding clamps of DNA 
polymerases have internal diameters that correspond well with the diameter of double-stranded 
5 DNA and are successful at achieving linear crossing of a DNA molecule. Many kilobases of 
DNA can be threaded through the sliding clamps. Several references also have demonstrated 
that linear crossing of DNA through channels occurs when the diameter of the channels 
corresponds well with that of the diameter of the DNA. (Bustamantc, 1991 ; Gurrieri et al., 1990; 
Matsumoto et ah, 1981). 

10 Single-stranded DNA, as used in the experiment, has a diameter of ~1 .6-nm. A channel 

having an internal diameter of approximately 1.7-3 run is sufficient to allow linear crossing of a 
single strand DNA molecule. The diameters of the channel and the DNA need not match exactly 
but it is preferred that they be similar. For double-stranded DNA which has a diameter of 3.4- 
nm, channel sizes between 3.5-nm and 4.5-nm are sufficient to allow linear crossing. 

15 As discussed earlier many methods may be used to move the polymer linearly across the 

channel and past the interaction station or signal generation station. A preferred method 
according to the invention utilizes an electric field. An electric field can be used to pull a 
polymer through a channel because the polymer becomes stretched and aligned in the direction 
of the applied field as has previously been demonstrated in several studies (Bustamante, 1991; 

20 Gurrieri et al., 1990; Matsumoto et al., 1981). The most related experiments regarding linear < 
crossing of polymers through channels arise from experiments in which polymeric molecules are 
pulled through protein channels with electric fields as described in Kasianowicz et al., 1996 and 
Bezrukov et al., 1994, each of which is hereby incorporated by reference. A brief description of 
these experiments is presented below in order to illustrate one method for enabling linear 

25 crossing of polymers. 

In a study entitled, "Characterization of individual polynucleotide molecules using a 
membrane channel," Kasianowicz et al., 1996 demonstrate the linear crossing of DNA molecules 
through protein channels in a lipid bilayer with an electric field (also described in PCT Published 
Patent Application WO 96/29593). An excerpt of the abstract follows. 

30 We show that an electric field can drive single-stranded RNA and DNA molecules 

through a 2. 6-nm diameter ion channel in a lipid bilayer membrane. Because the 
channel diameter can accommodate only a single strand of RNA or DNA, each 
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polymer traverses the membrane as an extended chain that partially blocks the 
channel. 

The assay is performed using Staphylococcus aureus -hemolysin as the membrane channel. 
It has a diameter of 2.6-nm and can remain open for extended periods of time, allowing 

5 continuous ionic current to flow across a lipid bilayer. The hypothesis is that while a DNA 
molecule traverses a channel, there should be blockage in the ionic flow. Using single channel 
recordings, the blockage time is recorded. The length of blockage time corresponds to the length 
of the single-stranded DNA molecule passing through the channel if linear crossing occurs. 
Initially a potential of -120 mV was applied across the membrane, causing a current flow. When 

10 single-stranded DNA was added, two consecutive conductance blockades of 300 jis and 1300 |as, 
respectively occurred. 

The data supports two hypotheses: 1) the length of blockage time (denoted as "lifetime") is 
directly proportional to the length of the DNA and 2) greater applied voltage shortens the 
blockage time for a given length of DNA., indicating that linear crossing indeed occurred. As a 
15 control, it was demonstrated that double-stranded DNA do not cross the protein channels. 

Bezrukov et al., 1994 have done similar studies with alamethicin pores in a paper entitled, 
"Counting polymers moving through a single ion channel" Alamethicin pores have internal 
diameters of 2 nm. The duration of the conductance blocks were proportional to the length of the 
polymer pulled through the pore, supporting the hypothesis for linear crossing. The results from 
20 Kasianowicz et aL, 1996 and Bezrukov et al., 1994 demonstrate that an electric field can drive 
DNA across a protein pore channel in a linear fashion. 

An experiment was performed to demonstrate according to the invention that DNA can pass 
though fabricated nanochannels. The results of the experiment are shown in Figure 1 1 . DNA in 
various forms was exposed to nanochannels to determine if it could pass through. Only double 

25 stranded linear DNA of 50 kb could pass through an array of 4 nm nanochannels. Folded DNA 
as modeled by a circular plasmid of the same size, cannot pass through the array. Lanes 1 and 2 
are the controls. Circular DNA migrates slower than linear DNA as expected. Lanes 3 and 4 
demonstrate that only linear DNA can pass through the nanochannel array. Since folded DNA 
which has a solvation diameter of approximately 5 nm, cannot pass through the array, the only 

30 means by which the linear DNA passes through the plate is in a linear fashion. 

Using the system described above by Kasianowicz et al., a randomly labeled polymer, 
such as DNA can be analyzed by passing the polymer through the channel and making unit 
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specific measurements. A channel can be prepared as a protein pore in a lipid membrane such as 
that described in Kasianowicz et al., and also in PCT published patent application WO/96/29593. 
Briefly, the S. aureus protein a-hemolysin is added to the cys side of a lipid bilayer. Lipid 
bilayers can be formed from, for example, diphytanoyl phosphatidylcholine by layering the 

5 solution on 0.2 mm holes in a Teflon film separating two compartments containing buffer 

solution. After the hemolysin is added, voltage can be applied across the bilayer and varied from 
0 mV to 140 mV. DNA which is randomly labeled is added to the buffer on the cys side of the 
protein. A voltage is applied which causes the labeled DNA to traverse from this cys to the trans 
side of the channel, which has a positive charge. As each unit passes through the channel, a 

10 change in conductance as a result of the blockage of the channel occurs. The change in 
conductance is dependent upon the size, shape and charge of the unit passing through the 
channel. If the unit is labeled, the conductance change will reflect the properties of the label. In 
this manner, labeled unit can be identified. This method can be used to identify a particular unit 
or to identify specific order of units or distance between units or simply the number of units 

15 which are labeled. 

Retrograde movement of the DNA is unlikely because of differences in frictional coefficients 
for inside the nanochannel and outside. The predicted van der Walls interaction between the 
solvated labeled DNA and the inside of the nanochannel creates a higher frictional resistance for 
the portion of the DNA located inside the nanochannel than that located in free solution. This is 
20 evidenced by the slower migration of DNA in the presence of a nanochannel plate (compare 
lanes 1 and 3 in figure 6). The differential in frictional resistance is likely to create a ratchet 
mechanism in the direct of the desired DNA movement. 

Another method for moving a polymer linearly past an interaction station or a signal 
generation station involves the use of a molecular motor. A molecular motor is a device which 

25 physically interacts with the polymer and pulls the polymer past the station. Molecular motors 
include but are not limited to DNA polymerases and helicases. DNA polymerases have been 
demonstrated to function as efficient molecular motors. Preferably the internal diameters of the 
regions of the polymerase which clamp onto the DNA is similar to that of double stranded DNA. 
Furthermore, large amounts of DNA can be able to be threaded through the clamp in a linear 

30 fashion. 

The overall structureof the p-subunit of DNA polymerase III holoenzyme is 80 A in diameter 
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with an internal diameter of -35 A. In comparison, a full turn of duplex B-form DNA is -34 A. 
The beta subunit fits around the DNA, in a mechanism referred to as a sliding clamp mechanism, 
to mediate the processive motion of the holoenzyme during DNA replication. It is well 
understood that the p-subunit encircles DNA during replication to confer processivity to the 
5 holoenzyme (Bloom ct al., 1996; Fu et al., 1996; Griep, 1995; Herendeen and Kelly, 1996; 

Naktinis et al., 1996; Paz-Elizur et al., 1996; Skaliter et al., 1996). Because the sliding clamp is 
the mechanism of processivity for a polymerase, it necessarily means that large amounts of DNA 
are threaded through the clamp in a linear fashion. Several kilobases are threaded through the 
clamp at one time (Kornberg and Baker, 1991). 

10 Methods for preparing the wall material of the invention are also encompassed by the 

invention. One method for preparing a wall material according to the invention includes the step 
of covalently bonding the agent to a plurality of discrete locations of a wall material. The agent 
is bonded to discrete locations on the wall material which are close enough to an interaction 
station, such that when an individual unit of a polymer, which is interactive with the agent to 

15 produce a signal, is positioned at the interaction station, the agent interacts with the individual 
unit to produce the signal. The discrete locations may be on the surface of the wall material or 
may be within the wall material such that the agent is embedded in the wall material. 

Another method is for attaching a chemical substance selectively at a rim of a channel 
through a wall material that is opaque. An "opaque" material as used herein is a material which 
20 is light impermeable at a selected wavelength. 

The wall material is provided with photoprotective chemical groups attached at the rim of the 
channel through the wall material. Light is then applied to the photoprotective chemical groups 
to dephotoprotcct the chemical groups, and a chemical substance is attached to the 
dephotoprotected chemical groups. 

25 A "photoprotective chemical compound" as used herein is a light sensitive compound which 

is capable of becoming chemically reactive when exposed to light. When light is applied to the 
photoprotective chemical groups the groups become dephotoprotected and are susceptible to 
interactions with chemical substances such as light emitting compounds and quenching 
compounds. 

30 Localized regions of agent can still be prepared even if the photoprotective chemical group 
covers the entire surface of the wall. The light may selectively be applied to regions of the wall 
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on which it is desirable to have the agent localized. For instance the light : may be applied only to 
the region of the wall surrounding the channel openings so that only those regions are 
dephotoprotected. When the agent is added it will only attach to the wall around the regions 
surrounding the channel openings. 

5 Additionally a method for preparing a wall material having localized areas of light emission 
on a surface of the wall material is provided. A light emissive compound is applied to the 
surface to produce at least localized areas of light emission on the surface. The light emissive 
compound may be added directly to the surface of the wall material or it may be attached 
indirectly to the surface through a photoprotective chemical group which has been attached to the 

10 surface and dephotoprotected by light. 

A "localized area of light emission" as used herein is a region of concentrated light emissive 
compound on the surface of the wall material which defines a target region around a rim of a 
channel through the wall material for detecting light emission. The localized area can be 
produced in several ways. Firstly, the light emissive compound may be attached directly to the 

15 wall surface only around the rim of the channel. Secondly, the light emissive compound may be 
attached to a photoprotective chemical group which has been attached selectively to localized 
areas around the rim of the channels. Alternatively the photoprotective chemical groups may be 
aJJed to the entire surface or to random regions of the surface of the wall material but only select 
regions are dephotoprotected by light to create localized regions to which the light emissive 

2o compound can be attached. Both the photoprotective chemical groups and the light emissive 
compound can be added to the entire surface or to random regions of the surface of the wall 
material when a mask is used to create localized regions of light emission. A mask having 
openings may be positioned over the wall surface such that the openings in the mask expose 
localized regions of the wall surface specifically around the openings of the channels. 

25 A wall material having localized areas of light emission on a surface of the wall material may 

also be prepared by first applying a light emissive compound to the surface to produce at least 
localized areas of light emission on the surface and then by creating a channel in the wall 
material wherein a rim of the channel forms a target region within the localized areas of light 
emission. 

30 An non-limiting example of a method for constructing an article of manufacture having a 

concentric ring of donor fluorophores around each channel is provided to demonstrate a preferred 
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method of the invention. The method is achieved by the use of photolabile protecting groups and 
light diffraction. A light impermeable polycarbonate porous medium is coated on one side with 
a dense layer of covalent linkers. The linkers are protected by photolabile protecting groups. 
Light is transmitted through the back side of the light impermeable polycarbonate. Since the 
5 wavelength of the deprotecting light source (400-500 nm) is much greater than the size of the 
channels (5 nm), each channel acts as a point source of light. The intensity of transmitted light is 
greatest closest to the channel. Accordingly, only those groups close to the channels are 
deprotected. Deprotected groups are free to react with donor fluorophores in subsequent 
chemical reactions to generate concentric rings. 

10 The radius of the concentric ring of deprotection is controlled through understanding the 

diffractive nature of light. The long wavelength of light and the small size of the channels set up 
a situation of very strong diffraction where diffracted angles are greater than 90°. Light does not 
pass through the light impermeable polycarbonate surface, but rather is forced to undergo 
diffraction through the channels. According to Huygen's principle, each channel acts as a 

15 secondary point source of light, spherically radiating. Huygen's principle states that all points on 
a wavefront can be considered as point sources for the production of spherical secondary 
wavelets. After a time / the new position of the wavefront will be that surface of tangency to 
these secondary wavelets. A direct consequence of Huygen's principle is the generation of 
secondary point sources at the exit ends of the channels. A further consequence is that the 

20 intensity of excitation light decreases with increasing distance from the center of each channel. 
The amount of light exiting the channel and the resulting spherical distribution of light 
intensity can be calculated. The power of light exiting the channel is given as the cross-sectional 
area of the channel multiplied by the intensity of the incident light (I Q ). 

P=I 0 KR channel 

Upon exit of the channel, the light becomes a spherically radiating point source. In this 
25 particular case, the light source is restricted to radiate to half a sphere (a surface area of 4 n 3^/2). 
The new intensities (I(r)) as a function of the radial distance from the center of the channel is 
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given as the power of light exiting the channels divided by the surface area of radiation at a given 
distance (r). 

j P _ ^ channel 

A 2r 

For a 60W light source 1 cm away from the back side of the light impermeable 
polycarbonate, the intensity decreases from 4.77 W/cm 2 to 0.43 W/cm 2 at a distance 35 nm 
5 (R^uai) fr° m the center of the channel, corresponding to a 91% intensity change. From the above 
equation, the radius of light around each channel can be precisely controlled by the intensity of 
the light source. 

Photosensitive protecting groups have been described in great detail (Pillai, 1980). 
Depending on the type of covalent linker (amine, hydroxyl, carboxylic acid, ketone, sulfhydryl, 

10 etc.), the corresponding photosensitive protecting group is available. For example, amino groups 
can be protected with nitroveratryloxycarbonyl (NVOC), 2-nitrobenzyloxycarbonyl, and ct- 
substituted 2-nitrobenzyloxycarbonyl groups. The latter two can also be used in the protection of 
carboxylic acids and hydroxyl groups. Photolytic deblocking of 2-nitrobenzyloxycarbonyl 
derivatives is straightforward, requiring only a 350 nm lamp and ethanol. The time of 

15 deprotection is controlled from 1 to 24 hours. 

The invention also encompasses an apparatus for detecting a signal. The apparatus provides 
a support for the article of manufacture and a sensor for detecting the signals generated by the 
interaction which occurs as the polymer traverses the interaction station. The apparatus includes 
a housing with a buffer chamber, a wall defining a portion of the buffer chamber, and having a 
20 plurality of openings for aligning polymers, and a sensor fixed relative to the housing, the sensor 
distinguishing the signals emitted at each opening from the signals emitted at the other of the 
openings to generate opening dependent sensor signals. 

An "opening dependent sensor signal" is a signal which arises at an opening in the wall 
material as a result of an interaction between a polymer and the station. 

25 The wall within the housing defines at least one and preferably two buffer chambers. A 
"buffer chamber" as used herein is area which is capable of supporting a liquid medium. The 
two buffer chambers may be in fluid communications with one another. 
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The wall has a plurality of openings formed by channels within the wall. A "plurality of 
openings" within the wall means at least two openings formed by at least two channels. 
Preferably a plurality of openings is at least 50 openings. 

A "sensor" as used herein is a device which responds to a physical stimulus and transmits a 
5 resulting impulse in the form of a signal. Sensors include but are not limited to optical sensors, 
temperature sensors, pressure sensors, auditory sensors, magnetic sensors, electrical, mechanical, 
radioactive and motion sensors. Preferably the sensor is an optical sensor. As used herein an 
''optical sensor" is a device which detects and converts input electromagnetic radiation signals an 
impulse. The impulse can be measured and stored as data. Optical sensors are well known in the 
10 art and include microscopes. The sensor is fixed relative to the housing such that the sensor is 
capable of detecting signals generated at the interaction station. It is not necessary that the 
sensor be secured or attached directly to the housing. 

A microprocessor as used herein is a device for collecting and storing sensor signals. In 
general a microprocessor is a chip containing several electronic components such as ROM, 
1 5 RAM, registers, and I/O controls. Conventional microprocessors are well known in the 
electronics arts. 

An example of an apparatus constructed to hold a nanochannel (or microchannel) plate (120) 
which is capable of generating an electric field is presented in Figure 12. The electric field, 
created by electrodes (128, 130), is used to draw the DNA through the nanochannels. The 

20 exemplary nanochannel plate is immersed in a slightly viscous buffer solution which helps to 
slow the transit of the polymer through the nanochannel, so as to allow for a longer signal 
collection time per base. In addition, on either side of the plate are electrodes (128, 1 30) 
immersed in the buffer solution. The ensemble of nanochannel plate, buffer compartments (122), 
and electrodes are contained in an optical quality glass chamber suitable for image analysis and 

25 are positioned adjacent to a 60x1 .4NA oil objective (126). 

As discussed above the use of an electric field to cause the polymer to move linearly through 
a channel is preferred. The use of an electric field is suitable because the stretched, linear 
orientation of a polymer in an electric field is favorable for linear crossing of nanochannels. 
Furthermore, the rate of polymer movement can be controlled by voltage. Lastly, an electric 
30 field does not adversely affect FRET. 
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Light microscopy (Bustamante, 1991; Gurrieri et al., 1990; Matsumoto et al., 1981 , 
Rampino and Chrambach, 1990; Schwartz and Koval, 1989; Smith et al., 1989), linear dichroism 
(LD) (Akerman et al., 1990; Akerman et al., 1985, Moore et al., 1986), fluorescence-detected LD 
(Holzwarth et al., 1987), and linear birefringence (Sturm and Weill, 1989; Chu et al., 1990) can 
5 be used to study the instantaneous changes in shape of DNA molecules undergoing gel 
electrophoresis. In these studies DNA is shown to be strongly oriented and stretched. 

Guirrieri et al., 1990 has demonstrated linear and stretched conformation of DNA molecules 
in an electric field. In each of the cases, the DNA molecule is clearly aligned in the direction of 
the applied electric field. The method used to visualize the DNA molecules combines 
10 fluorescent DNA labeling and use of an image intensificr-video camera system (Bustamante, 
1991; Houseal et al., 1989; Morikawa and Yanagida, 1981; Matsumoto et al., 1989; Yanagida et 
al., 1983). The DNA molecules shown are T2 molecules which are 164 kbp long. 

The orientation of DNA in an electric field has been well studied with linear dichroism and 
electric dichroism (Ding et al., 1972; Yamaoka and Charney, 1973; Colson et al., 1974; Hogan et 
15 al., 1978; Priore and Allen, 1979; Yamaoka and Matsuda, 1981; Wu et al., 1981). In fact, the 
first studies done on DNA orientation have been performed with these two techniques. DNA 
was first studied in solution and then subsequently in electrophoretic gels. Studies both in 
solution and in gels yield similar results in that the DNA molecules are indeed oriented in the 
direction of the electric field. 

20 Native DNA exhibits a negative UV linear dichroism (LD) as a result of the preferential 

orientation of the nucleotide bases nearly perpendicular to the field. The orientation of DNA has 
been attributed to the presence of either a permanent or an induced dipole moment. For example, 
Ding et al., 1972 describe their observation of DNA in a Tris-cacodylate buffer diluted in cthanol 
to give 80% v/v ethanol. 

25 LD data from single electric pulse of 9 V/cm. Duration of pulse is indicated by the 

horizontal bar. At steady-state, the plateau of the LD r reads close to -1 .5, which means that the 
DNA is oriented in the direction of the electric field (Akerman et al., 1990). 

Akerman et al., 1990 have performed LD studies on pulsed-field gel electrophoresis and 
similar results are obtained as in solution. Akerman provides a plot of LD r versus time. LD r 
30 represents orientation in the electric field. A value of -1 .5 means that the DNA molecule is 
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oriented in the electric field. A horizontal bar on the x-axis denotes the duration of the electric 
pulse. The DNA-is in 1% agarose. A short time after the beginning of the pulse, the curve 
plateaus at a LD r close to -1 .5, meaning that the DNA is oriented a short time after the beginning 
of the pulse. When the pulse is turned off, the DNA no longer orients itself in the field and the 
5 LD r curve no longer remains at -1.5. 

The rationale for DNA orientation in electric fields is based on either a permanent or induced 
dipole moment in DNA. Recall that dipoles align themselves in the direction of electric fields to 
minimize torque. Hogan et al., 1978 have proposed a rationale for the induction of a dipole 
moment in DNA that relates to anisotropic ion flow: 

10 In order to explain the dependence of the dichroism on the electric field, the ionic 

strength of the medium, and the length of the macromolecule, we propose a new model in 
which anisotropic ion flow produces an asymmetric ion atmosphere around the 
polyelectrolyte, resulting in an orienting torque. 

The conclusion from the discussion on DNA alignment in electric fields is that DNA molecules 
15 and other polymers align themselves in the direction of electric fields whether in an 

elcctrophoretic gel or in solution. Solution studies were performed before electrophoretic 
studies. The implications of DNA alignment in an electric field further support the fact that 
DNA molecules and other polymers can be driven across nanochanncls in a linear fashion. The 
unfolded orientation of the DNA is also important in the linear crossing of DNA molecules. As 
20 described above DNA and other polymeric molecules can be driven through protein channels in a 
linear fashion with an electric field. The data described herein rationalize this further and give a 
theoretical basis to why DNA molecules can behave in such a fashion. Given that DNA has been 
shown to pass through protein pores in a linear fashion and that DNA molecules align themselves 
in the direction of electric fields, linear crossing of DNA through nanochannels can be achieved. 

25 The rate at which a polymer moves is also important because the duration of energy transfer 
is important. The longer an acceptor remains in the donor fluorophore layer, the greater the 
signal generated. Millisecond integration times allow for unequivocal signal detection. Since 
the layer of donor fluorophore is roughly 40 A, the rate of polymer movement that is needed can 
be approximated as 40 A/ 10 ms, or 4000 A/s. In nanochannel FRET sequencing, the proposed 

30 rate is achieved by controlling either the voltage of the applied electric field or the frictional 
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coefficient of the polymer molecule. 

The rate of DNA movement can be determined given the equation that defines the movement 
of a DNA strand in an electric field (Tinoco et al., 1995): 

ZeE 
u - 

/ 



where Z is the (unitless) numbers of charges, e = 1 .6022x1 0" 19 coulombs, E is the electric field in 
5 colts per m, and /is the frictional coefficient in kg/s. The velocity of motion this depends on the 
magnitude of the electric field E, the net charge of the molecule, and the size and shape of the 
molecule as characterized by its frictional coefficient,/ The net charge on the molecule is 
designated by Ze. The frictional coefficient can be determined from the following equation 
(Tinoco etal., 1995): 

/ = kf o = k(6nr)r) 

10 where r| is the viscosity coefficient, /* is the radius for a sphere having the same volume as the 
DNA, A: is a shape factor (which for a strand of DNA is about 1 .7), and / is the frictional 
coefficient for a sphere having the same volume as the DNA. 

Rationally, it can be seen that either the frictional coefficient or the magnitude of the electric 
field can be used to control the rate of polymer movement. For different voltages in a given 

15 system, there are different polymer migration rates. In a similar fashion, larger molecules or 
molecules in more viscous media have slower molecular mobility. From examples in gel 
electrophoresis, calculations of the effects of electric field strength, polymer size, and frictional 
coefficient are possible. It is important to understand a viscous medium can achieve the same 
desired effects on the frictional coefficient as a gel. Since the frictional coefficient is inversely 

20 proportional to the electrophoretic rate, doubling viscosity decreases electrophoretic rate by two- 
fold. 

Electrophoresis in viscous solvents has been performed by a number of groups (Chang and 
Yeung, 1995; Bello et al., 1994; Jumppanen and Riekkoloa, 1995; Sahota and Khaledi, 1994; 
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Klenchin et al., 1991; Harris and Kell, 1985; Korchemnaya et al., 1978). Sahota and Khaledi, 
1994 demonstrate electrophoresis in formamide, which has a viscosity of 3.3 cP at 25 °C, triple 
that of water (0.89 cP, 25 °C). Electrophoresis in formamide is more favorable than in water 
because formamide has a higher dielectric constant than water and can solubilize many 

5 supporting electrolytes. It is commonly used in mixtures with water in electrophoretic buffers. 
Mixed glycerol solutions, N-methyl-2-pyrrolidone, acetic acid, and TV-methyl formamide have 
also been used (Korchemnaya et al., 1978). A large range of viscosities are possible, ranging 
from 1 cP to as high as 25 cP (Bellow et al., 1994). The actual viscosity of the solution needed 
depends on the frictional coefficient of polymer molecules in nanochannels. The net frictional 

10 coefficient thus depends on viscosity and also the frictional effects of the nanochannels on the 
polymer. From the small bore of the nanochanncl, there are frictional effects from solvent 
trapping between the walls of the nanochannels and the labeled polymer. The magnitude of these 
solvent trapping effects is similar to those arising from gel electrophoresis. As a result, only a 
slightly viscous solution may enhance the desired rate of polymer movement. 

15 For a given electric field, the electrophoretic mobility for different sized molecules can be 

determined by knowing the relationship between molecular weight and distance migrated, given 
as: 

log M - a ~ bx 

where M is the molecular weight of the nucleic acid and x is the distance migrated (proportional 
to mobility), a and b are constants for a given electric field. 

20 To summarize, the rate of polymer movement in an electric field can be controlled by the 
electric field, frictional coefficient, and molecular size. Desired values for the rate of polymer 
movement are readily achievable. 

There are no adverse effects of an electric field on FRET. Since energy transfer is related to 
short range electrical dipole interactions, the presence of an outside electric field may negatively 
25 affect the ability of donor-acceptor pairs to undergo FRET. The presence of an external electric 
field has no adverse effects on FRET as seen from recent studies performed with FRET primers 
and gel-based automated DNA sequencing (Glazer and Mathies, 1997; Hung et al., 1996; Ju et 
al., 1996a; Ju et al., 1996b; Marra et al., 1996; Wang et al., 1996; Wang et al., 1995). FRET 



n\icnnr.m --W^ 



WO 98/35012 PCT/US98/03024 

-113- 

primers are used in the labeling of sequencing reaction products because of the large Stoke' s 
shift and thus greater fluorophore discrimination. In these experiments, the FRET labeled DNA 
fragments are run on a gel and concurrently, in the presence of the electric field, the fragments 
are detected with a CCD camera and laser. Either a four-color capillary electrophoresis system 
5 or Applied Biosystems 373 A sequencer is used. These experiments demonstrate that energy 
transfer occurs in the presence of a strong electric field. 

An exemplary design of a nanochannel apparatus is shown in Figure 13, which consists of 
two fused Pyrex cells (132, 134) that hold the nanochannel plate (140). The upper (136) and 
lower (138) buffer regions are contained in the smaller (134) and larger (132) Pyrex cells, 

10 respectively. The buffer regions are sealed off with glass coverslips (146) that have been 

modified with indium tin oxide (ITO). Indium tin oxide is a clear conductor which functions in 
this system as electrodes (142). The electrodes lead to a variable voltage supply (152) consisting 
of a load resistance R L (one to several G , to approach a constant current system) and a rheostat- 
controlled voltage from a voltage source. A voltmeter is connected in parallel. Under the glass 

15 coverslip an oil (144) coated immersion objective (150) is positioned to detect fluorescence 
signals (148). 

The polymer is loaded into the upper buffer region and the electric field is used to drive the 
polymer to the lower chamber. The buffer has the desired viscosity and electrolytic properties. 
Epiillumination and signal collection are possible through the same 60x, 1.4 NA oil immersion 

20 objective. The dimensions of the apparatus are similar to that of a conventional microscope slide 
(75 mm x 25 mm) for ease of mounting. The gap between the nanochannel plate and the glass 
coverslip is 0.4 mm. A thin coverslip with a thickness of 0.1 mm is shown. These dimensions 
are preferred because long working distance objectives are available. For instance, the CFI 60 
system from Nikon has 60x, 1 .4 NA objectives that can examine specimens over 0.6 mm thick. 

25 A temperature control block is used to prevent Joule heating and thermal fluctuations. The 
block can surround the nanochannel apparatus and has inlet/outlet ports for coolant flow. The 
block is hollow to allow uniform temperature control of the nanochannel apparatus. There are 
inlet/outlet ports on either side of the apparatus that allow the recycling of the coolant through 
the hollow chamber. 

30 Several groups have used specialized apparatuses for fluorescence microscopy observations 
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of electrophoresis (Rampino and Chrambach, 1991; Matsumoto et al., 1981; Smith et al., 1989; 
Bustamante, 1991; Giirrieri et al., 1990; Schwartz and Koval, 1989; Smith et ah, 1990). The 
apparatus used by Smith et al., 1990 for the observation of DNA molecules undergoing pulsed- 
field gel electrophoresis is described: 

5 A 20-jal drop of molten agarose was placed between two microscope coverslips (24 mm x 

24 mm) and allowed to cool for 10 minutes. This sandwich arrangement was placed on 
top of a regular microscope slide and sealed at the four corners with fingernail polish. 
The slide had been previously prepared with four copper electrodes fixed on it with epoxy 
adhesive. This apparatus was then refrigerated at about 5°C for 30 minutes. Molten 

10 agarose was dropped over the electrodes to complete the electrical connection, and TBE 
buffer (0.5x) was occasionally added to keep it wet. 

Rampino and Chrambach, 1991 describe a more complex apparatus that has larger buffer 
regions, platinum strip electrodes, and a region for coolant flow. The entire apparatus is 
1 5 mountable on the microscope stage. The actual gel itself is sandwiched between two glass 

coverslips and placed in the center of the apparatus, in contact with buffer blocks on either side. 
The design allows for a homogenous electric field and temperature control. In contrast to the 
apparatus described above, the Rampino and Chrambach apparatus is designed for standard and 
not pulscd-field gel electrophoresis. 

20 Briefly, a description of how the apparatus (shown in Figure 13) of the invention functions in 

relation to polymer is described below. A polymer such as DNA is labeled (intrinsically or 
extrinsically) by the methods of the invention or any other method known in the art. The labeled 
DNA is placed in a buffer solution, which is preferably slightly viscous. The buffer is added to 
the upper buffer chamber 136. An electric field is created using the electrodes and the DNA is 

25 caused to enter the nanochannels of the nanochannel plate (1 40) in a linear manner. As the 

labeled DNA emerges from the other side of the nanochannel plate the label is caused to interact 
with an agent or the environment surrounding the channel to produce a signal or polymer 
dependent impulse which is detected by the detection device. The detected signal or polymer 
dependent impulse is stored and processed to reveal structural information about the polymer. 

30 A final aspect of the invention encompasses a method for preparing nucleic acids for use 

according to the methods of the invention as well as for any other use in which it is desireable to 
utilize randomly labeled nucleic acids. The method involves the steps of contacting a dividing 
cell with a nucleotide analog, isolating from the cell nucleic acids that have incorporated the 
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nucleotide analog, and modifying the nucleic acid with incorporated nucleotide analog by 
labeling the incorporated nucleotide analog. A "nucleotide analog" as used herein is a molecule 
which can be substituted for A, T,.G or C but which has a modified structure. Nucleotide 
analogs include for example but are not limited to a brominated analog, 2,4-dithiouracil, 2,4- 
5 Diselenouracil, hypoxanthine, mercaptopurine, 2-aminopurine, and selenopurine. The 

incorporated nucleotide analog may be labeled with an agent selected from the group consisting 
of a light emitting compound, a quenching source and a fluorescence excitation source. 

A dividing cell is "contacted" with a nucleotide analog by any means known in the art for 
incorporating nucleotides into living cells. For instance the nucleotide analog may be added to 

10 the cell culture medium and taken up by the cell naturally. In order to optimize the uptake of the 
nucleotide analog the dividing cell may be growth arrested using conventional means prior to 
adding the nucleotide analog to the medium and then the arrest removed to allow the cells to 
reenter the cell division cycle once the nucleotide analog is added to the medium. The nucleic 
acids having the incorporated nucleotide analog may then be isolated after the cells have 

15 reentered and completed at least one cell division cycle. 

An example of a method for incorporating a nucleotide analog into DNA is provided in Bick 
and Davidson, Proc. Nat. Acad. ScL, 71:2082-2086 (1974). Bick and Davidson grew the BrdU- 
dependent cell line, B4, in a basic growth medium of Dulbecco's modified Eagle's medium 
supplemented with 10% fetal-calf serum (E medium) containing 0.1 mM.BrdU. 100-mm Falcon 

20 plastic tissue culture dishes were inoculated with 10 6 B4 cells in E medium containing 0.1 mM 
hypoxanthine, 0.4 jiM aminopterin, and 10 |iM BrdU (E-HAB medium) and the cells were 
passaged three times at high density (5 X 10 5 cells per 100-mm dish) in E-HAB medium over six 
weeks. The cells were then plated at low density (1000 cells per 6-mm dish) in E-HAB medium 
and three weeks later, approximately 10 large colonies were observed in each dish. The cells 

25 from one dish were harvested and maintained as a new cell line called HAB. Because of the 
expected photosensitivity of BrdU-containing cells, the HAB cells were at all times protected 
from environmental lighting. After 100 cell generations in E-HAB medium, an aliquot of HAB 
cells was transferred back to E medium. A new subline of cells, called HAB-E, was isolated and 
maintained in E medium. 

30 Bick and Davidson grew the cell in the media (described above) to which was added H 3 32 P0 4 



BNSDOC1D: <WO 



99350 12A2 I > 



WO 98/35012 PCI7US98/03024 

- 116- 

(New England Nuclear Corp.) at an activity of 2 jiCi/ml. After 2-3 population doublings the 
cells were harvested and DNA was isolated as described in Davidson and Bick (Proc. Nat. Acad. 
ScL, 70:138-142 (1973)) except that after the first phenol extraction, DNA preparations were 
routinely treated with RNase A at a concentration of 50 \xg/ml for 60 min at 37 After a second 
5 phenol extraction, the DNA was extensively dialyzed against 10 mM Tris-HCl, 10 mM pH 7.6, 
1 mM EDTA, and then finally dialyzed against 10 mM Tris-HCl pH 7.6, 1 mM EDTA. Other 
methods for isolating DNA are well known in the art. See e.g., Sambrook. et al., Molecular 
Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor Laboratory Press, 1989). 

The method of the invention for preparing nucleic acids is referred to hereinafter as in vitro 
10 base specific labeling (IBS A). IBS A involves the culture of cells derived from a subject in the 
presence of nucleotide analogs. The nucleotide analogs are either capable of subsequent 
chemical modifications, have a high molar extinction coefficient in the donor emission 
wavelengths, or have linkers for the attachment of acceptor labels. The most important analogs 
are the ones which contain functional groups that do not interfere with base pairing during DNA 
15 replication. The 8-substituted purines, 5-substituted pyrimidines, and 6-substitutcd pyrimidines 
satisfy this criteria. 

A non-limiting example of an IBSA scheme is set forth in detail below. The method is 
outlined in Figure 14 and covers the steps from initial DNA isolation to genomic labeling and to 
final sequence analysis. Cells are isolated from the subject and then grown. At a time when 

20 there are enough cells to provide adequate genetic material, the cells are arrested in the cell cycle. 
The arrest can occur at any point in the cycle as long as it does not interrupt the S phase. 
Nucleotide analogs that can be easily chemically derivitized, such as 8-bromo-2 , -deoxypurines 
and 5-bromo-2'-deoxypyrimidines, are added to the growth medium. The cells are then grown 
for one cycle and the genomic DNA from the cells are isolated. The resulting DNA is a chimera 

25 of labeled and unlabeled DNA, a product of semiconservative replication. The advantage in 

having the DNA with only one strand labeled is so that the resultant duplex DNA can be used in 
a sequencing method of the invention. 

The genomic DNA is broken down into sizes of approximately 100 kb. The genomic DNA is 
chemically labeled and end-labeled at this point. The advantage of culturing cells in brominated 
30 analogs is that such a modification activates the bases for further modification, the addition of 
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acceptor labels to the bases. For a human genome, there are approximately thirty thousand 
fragments. The fragments are analyzed simultaneously by the sequencing methods of the 
invention. Fragments of similar size and potential sequences are grouped together and 
population analyses are performed to generate DNA sequence information. Sequencing of a 
5 genome can be completed in approximately six hours. The time from isolation of a subject's cell 
to the end of sequencing is approximately one week. The following paragraphs describe each 
individual step in detail. 

The first step to the outlined scheme is cell culture and cell cycle arrest. The purpose of the 
cell cycle arrest is to generate a synchronous population of cells which can undergo DNA 

lo replication at the same time to ensure all the resultant DNA will be a chimera of labeled and 
unlabeled DNA. There are many methods to synchronize cells including methods which arrest 
cells at cither M or S phase. Only an M phase block would produce the desired genomic 
chimeras. Examples of metaphase inhibitors include nitrous oxide under pressure, vinblastine 
(Marcus & Robbins, 1963), and colcemid. Colcemid blocks cells in metaphase but can be 

15 reversed by washing (Mitchison, 1971; Kato & Yoshida, 1970; Stubblcfield, 1968). Cells are 
thrown for a few hours in colcemid, resulting in a synchronous culture (Wunderlich & Peyk, 
I 4 >6 4 >). Nitrous oxide under pressure (Rao, 1968) is advantageous in that it can be easily 
removed. Removal of the block and addition of the desired brominated base analog allows for a 
synchronous entrance into S phase. 

20 The nucleotide analogs which can be used include but are not limited to S-bromo-2 1 - 

deox> adenosine, 8-bromo-2'-deoxyguanosine, 5-bromo-2 , -deoxycytidine, and 5-bromo-2'- 
dcox\ uridine. These brominated nucleotide analogs are activated for coupling to nucleophiles 
( Iraincard et al.., 1983; Sakamoto et al., 1987; Keller et al., 1988; Hermanson, 1996). Coupling 
nt 'amine-conlaining acceptor labels and fluorophores to the base analogs is done at ambient 

25 temperatures (35°C). The final positions of the acceptor labels are such that the base-pairing of 
the labeled fragment is not affected. 

Brominated nucleotide analogs are commercially available (Sigma Corp., Fluka Inc., Fisher 
Scientific, Inc.) or can be prepared by reaction with brominating reagents. Hermanson, 1996 
outlines a protocol for bromination of DNA at thymines, cytidines, and guanosines using N- 
30 bromosuccinimide (NBS). Adenine residues are prepared from other methods which do not 
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require NBS. For an example, 8 -bromo-2'-deoxy adenosine and 8-bromo-2'-deoxyguanosine can 
be synthesized by reaction of the native base with bromine water (Ikehara et al., 1969a; Ueda et 
al., 1974; Faerber & Scheit, 1971; Kochetkov et al., 1968). Bromination at the eighth position of 
the purines can also achieved via the nucleotide morpholidate (Lang et al., 1968; Ikehara et al., 
5 1969b). The solvent condition for bromination is especially important. Bromination of 2'- 
deoxyadenosine does not take place in dimethylformamide at 0°C, but if the reaction is 
conducted at 50~60°C in glacial acetic acid, 8-bromo-2'-deoxy adenosine is formed relatively 
easily (Gueron et al., 1967; Rahn et al., 1965). 

Brominatcd and other base analogs have been demonstrated to be efficiently incorporated 
10 into genomic DNA during cell culture (Balzarini et al., 1984). A particular example is the 
incorporation of 5-bromo-2'-deoxyuridine into cells used for flow cytometry. 5-bromo-2'- 
deoxyuridine is incorporated into newly synthesized DNA in the place of thymidine (Crissman et 
al., 1990; Foot & Hoehn, 1990; Bohmer, 1990; Gaines et al., 1996; Nicolas et al., 1990). The 
percentage of substitution of base analog can be very high as according to Bick and Davidson, 
15 1 974 in their paper discussed above and titled, "Total Substitution of Bromodeoxyuridine for 
Thymidine in the DNA of a Bromodeoxyuridine-Dependent Cell Line. The following is a 
paragraph from this paper: 

To obtain total substitution, BrdU-dependent cells were exoposed to culture medium 
containing aminopterin, a powerful inhibitor of thymidine biosynthesis, and BrdU in the 
20 absence of added thymidine... We report here the isolation of a cell line in which at least 

99.8% of the thymidine in the nuclear DNA has been replaced by BrdU. 

In the current proposed random one-base labeling scheme, percentages as high as 99.8% 
substitution are not necessary but such data gives insight into possibilities of cellular 
incorporation of base analogs. Other brominated base analogs can be similarly incorporated into 
25 mammalian cells with ease (Stewart et al., 1968). A few examples of non-bromine base analogs 
that have been incorporated in this manner include 2-aminopurine (Glickman, 1985), 5- 
propynyloxy-2'-deoxyuridine, and 5-ethynyl-2'-deoxyuridine (Balzarini et al., 1984). 

After generation of the appropriately brominated DNA chimeras through cell culture, the 
chimeric genomic DNA is prepared for analysis by performing several steps, including an 
30 optional step of chromosome sorting. 100 kb DNA fragments can be generated from the partial 
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digestion of flow sorted chromosomes. The optional flow sorting step has been covered in detail 
by many manuals and textbooks (de Jong et al., 1989; Fawcett et al., 1994) and will not be 
discussed in detail. In brief, the cells are arrested in metaphase by the addition of colcemid. 
Several million of the cultured cells are gently lysed to release the metaphase chromosomes into 
5 a suspension buffer in which they are stained with one or more fluorescent dyes. The target 
chromosomes are then identified by their pattern of fluorescence emission, using either a single 
fluorophore in a one laser system, or two fluorophores in a two laser detection system. The 
sorted chromosomes are then prepared and partially digested as according to protocols described 
by Glover and Hames, 1995 in DNA Cloning 3. Sizes on the order of 100 kb arc obtained. 

10 Fluorophore labeling of the brominated DNA can be done either before or after partial 

digestion. The base bromination activates the bases for nucleophilic displacement. Many 
different fluorophores with amine/hydrazine groups can be used including the popular dyes 
fluorescein, coumarin, eosin, rhodamine, and Texas Red (Molecular Probes, OR). These have 
absorptions and emissions in the visible region. The amine group on these fluorophores attack 

15 the 8 position of the purines and the 5 position of the pyrimidines, undergoing a nucleophilic 
displacement reaction. The experimental protocol for the displacement reaction is described by 
Hermanson, 1996. A modified outline of the labeling scheme described is given below. 

Protocol for labeling Brominated DNA 





1. 


Dissolve a hydrazine or amine-coupled fluorophore in water at a concentration 


20 




of 80-1 00 mM. 




2. 


Add 25 ul of the fluorophore solution to a bromine-activated DNA solution. 




3. 


React for 1 h at 50°C. 




4. 


Purify by ethanol precipitation. Add 20 ul of 4 M LiCl and 500 pi of ethanol 
'\ (chilled to -20°C). Mix well. 


25 


5. • 


Store al -20°G for 30. min, then separate the precipitated DNA by 
centrifugation at' 1 2,000 g. 




6. 


Remove the supernatant and wash the pellet with 70% and 100% ethanol. 
ccntrifuging after each wash. 



Redissolve the labeled DNA pellet in water and store at -20°C. 
30 From the genomic partial digestion, approximately thirty thousand 1 00 kb fragments are 



created. The fragments are end-labeled either with terminal transferase or through ligation to 
short labeled polynucleotides. The fluorescent labels incorporated by end-labeling are uniquely 
identified either by different spectral absorbances or a specific sequence of labels (as in the 
ligation scheme). After end-labeling, the prepared DNA is subject to nanochannel FRET 
35 sequencing as described above. The data is sorted by fragment size and potential sequence. 
Through population analysis, the DNA sequence is generated. 
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Figure 1 5 presents the general scheme for deciphering DNA sequence from IBSA labeling. 
A sample of genomic DNA has two of its bases labeled (in this case A and C). The end-labeled 
and internally labeled DNA fragments are analyzed by FRET sequencing. The resulting data is 
sorted by fragment size and potential sequence. Potential sequence is defined as the pattern of 

5 sequence-specific FRET signals generated for a particular fragment of DNA. A potential 

sequence does not give sequence information, but rather allows DNA molecules to be uniquely 
identified. Population analyses are performed to determine all the positions of the A's and C's 
on each of the fragments. Complementary strands are paired. The process of determining base 
position is repeated for only one of the bases (A). Comparison of the resulting data for one 

10 labeled base and two labeled bases generates the DNA sequence. 

Hundred kilobase fragments are not always desired for sequencing. Expression mapping and 
single gene sequencing require much shorter labeled fragments. The generation of shorter 
labeled fragments can be achieved by nick translation, primer extension, and the polymerase 
chain reaction (PCR). In addition, smaller fragments can also be analyzed by a modification of 
15 the Sanger reaction and the use of terminal transferases. With enzymatic methods, large 
nucleotide analogs can often be incorporated (Hermanson, 1996). 

In primer extension, modified nucleoside triphosphates are added to a DNA template using a 
defined amount of the desired primer. The form of polymerase used is the Klenow fragment, 
which lacks the 5*-3 5 exonuclease activity of intact E. coll DNA polymerase I (Kessler et ah, 

20 1990; Feinberg and Vogelstein, 1983, 1984). Equivalent enzymes are the Sequenase 2.0 and T7 
polymerases. This method is a simple method of preparing internally labeled target DNA. Using 
the method, synthesis of large internally labeled DNA fragments with lengths greater than 10 
kilobases have been synthesized with one of the nucleotides replaced by a modified base. A 
brief summary of the protocol involves using Sequenase 2.0 (Amersham). The annealing 

25 reaction consists of 1 ul primer (0.5 pmol/ul) ? 2 ul Sequenase buffer (5X), 5 [ig plasmid DNA, 
and 10 mI sterile distilled water. After incubation in a 65 °C waterbath for 2 minutes, annealing is 
allowed to occur at 30°C for 30 minutes. The actual reaction mixture consists of 10 ^1 of 
annealing reaction (primer-template), 1 \i\ dithiothreitol (0.1 M), 1 }iM of each dNTP, 0.1 |aM of 
a fluorophore dNTP, and 0.025 U/^l of Sequenase. After mixing, the reaction is allowed to 

30 proceed at room temperature for 1 5 minutes. 
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Using primer extension, Ambrose et al. (1993) and Harding and Keller (1992) have 
demonstrated the synthesis of large fluorescent DNA molecules. In their experiments, complete 
replacement of one pyrimidine with a fluorophore analog was achieved in DNA longer than 7 kb. 
Fluorescent DNA between 5 to 7 kb in length, in which two pyrimidines are completely labeled, 
5 have also been reported by the same authors. Large fragments using other unusual base analogs 
such as 7-(2-oxoethyl)guanine, 2'-amino-2'-deoxycytidine, 8-oxopurine, and N-4- 
aminodeoxycytidine have been synthesized as well (Barbin et al., 1985; Purmal et al., 1994; 
Aurup et al., 1994; Negishi et al., 1988). 

Nick translation takes advantage of the ability of Escherichia coli DNA polymerase I to 
10 combine the sequential addition of nucleotide residues to the 3'-hydroxyl terminus of a nick 
[generated by pancreatic deoxyribonuclease (Dnase) I] with the elimination of nucleotides from 
the adjacent 5'-phosphoryl terminus (Meinkoth & Wahl, 1987; Rigby et al., 1977; Langer et al.. 
1981; Holtke et al., 1990). Many modified bases have been incorporated by nick translation 
(Meffert and Dose, 1988; Gebeyehu et al., 1987; Gillam and Tener, 1986). DNA polymerase I 
15 causes breaks to be filled as rapidly as they are formed, incorporating the desired nucleotides into 
the original strands. Since a quantity of labeled nucleoside triphosphates are present during the 
reaction, the labels get incorporated and the parent strands are modified. Nick translation 
involves a mixture of the double-stranded DNA target, 1 \x\ of DNAse 1 at a concentration of 2 
ng/ml, 1 |il each of three types of unmodified dcoxynucleoside triphosphates (dNTPs at 100 \\M 
20 concentration), 1 |il of a labeled dNTP (at 300 |iM), 32 \i\ water, and 1 \x\ of DNA polymerase 
containing 5-10 units of activity. Reaction proceeds for 1 h at 15°C. Reaction is quenched by 4 
M l of 0.25 M EDTA, 2 \x\ of 10 mg/ml tRNA, and 150 \x\ of 10 raM Tris, pH 7.5. Labeled DNA 
is purified by ethanol precipitation (Bethesda Research Laboratories, New England Nuclear, and 
Amcrsham). 

25 Direct PCR labeling not only incorporates the desired label into the target DNA, but also 

amplifies the amount of labeled DNA (Saiki et al., 1985,1988). PCR uses the heat-stable forms 
of DNA polymerase, most commonly, the Taq polymerase from Thermus aquaticus. Tag 
polymerase has the capability to incorporate labels into growing DNA copies with each cycle of 
PCR. In this manner, DNA fragments up to 3 kb can be labeled. Wiemann et al., 1995 describe 

30 the method of internally labeling PCR products with fluorescein- 15-d ATP, a protocol that can be 
applied to other fluorophores. The reaction mixture for PCR incorporation of fluorophores 
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consists of 1 |iM of each primer, 200 \iM of each dNTP, 10 jiM of a fluorophore-dNTP, and 
0.025 U/[xl of Taq polymerase (Perkin-Elmer, Norwalk, CT). The reaction buffer consists of 10 
mM Tris-HCl, pH 83, 50 mM KC1, 1 .5 mM MgCl 2 , and 0.01% (wt/vol) gelatin. PCR is 
performed using thirty cycles. 

5 Sequencing reactions generated by the Sanger method can also be analyzed with the aid of 

enzymatic labeling. Each chain-terminated fragment is labeled with a 5' and 3' fluorophore. 
The 5' fluorophore is incorporated as part of the primer and the 3' end-label is added by terminal 
transferase. Terminal transferase labeling was originally developed using radiolabeled 
nucleoside triphosphates (Roychoudhury et ah, 1979; Tu and Cohen, 1980). The technique was 

10 later applied to non-radioactive nucleotide analogs (Kumar et al., 1 988). The technique involves 
addition of the target DNA to (a) 20 |il of 0.5 M potassium cacodylate, 5 mM CoCU, 1 mM 
DTT, pH 7, (b) lOOjaM of a modified deoxynucleoside triphosphate, 4^1 of 5 mM dCTP, and 100 
fil of water. Terminal transferase is added to a final concentration of 50 units in the reaction 
mixture and reacted for 45 minutes at 37°C (Hermanson, 1996). 

15 A post-labeling procedure can be used as well. In this method, a base analog with a linker 

arm for attaching a fluorophore is initially incorporated into the DNA using the methods 
described above. Subsequent attachment of fluorophores to the linker arms is possible using 
covalent linking techniques such as those described by Waggoner (1995). Jett et al. (1995) has 
demonstrated the full length synthesis of M13 DNA (7250 bp) with complete incorporation of 

20 one of the following: 5-(3-aminopropynyl)-dCTP, 5-(3-aminoallyl)-dCTP, or 5-(3-aminoallyl)- 
dUTP. Covalent fluorophore attachment to the linkers was also shown. The advantage of this 
technique is that the label need not consist of individual fluorescent molecules, but could be 
larger molecules with greater number of dyes. Such examples include phycobiliproteins, dye- 
filled beads, tagged proteins, or chains of fluorescent tags. 

25 Each of the foregoing patents, patent applications and references is herein incorporated by 

reference in its entirety. Having described the presently preferred embodiments in accordance 
with the present invention, it is believed that other modifications, variations and changes will be 
suggested to those skilled in the art in view of the teachings set forth herein. It is, therefore, to be 
understood that all such variations, modifications, and changes are believed to fall within the 

30 scope of the present invention as defined by the appended claims. 

What I claim is: 
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Claims 



1 . A method for analyzing a polymer of linked units comprising 

exposing a plurality of individual units of a polymer to an agent selected from the group 
5 consisting of electromagnetic radiation, a quenching source and a fluorescence excitation source, 
individual units interacting with the agent to produce a detectable signal, and 

detecting sequentially the signals resulting from said interaction to analyze the polymer. 



2. The method of claim 1, wherein the signal is electromagnetic radiation. 



10 



3. The method of claim 2, wherein the agent is electromagnetic radiation. 



4. The method of claim 3, wherein a portion of the plurality of individual units of the 
polymer is labeled with a fluorophore. 

15 

5. The method of claim 3, wherein the plurality of individual units of the polymer are 
sequentially exposed to electromagnetic radiation by bringing the plurality of individual units in 
proximity to a light emissive compound and exposing the light emissive compound to 
electromagnetic radiation, and wherein the plurality of individual units of the polymer detectably 

20 affect emission of electromagnetic radiation from the light emissive compound. 



6. The method of claim 3 wherein the plurality of individual units of the polymer are 
sequentially exposed to electromagnetic radiation, and wherein the electromagnetic radiation 
detectably affects emission of electromagnetic radiation from the plurality of individual units of 
25 the polymer to produce the detectable signal. 
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7. The method of claim 5, wherein the individual units detectably affecting emission of 
electromagnetic radiation from the light emissive compound are labeled with a fluorophore. 



8. The method of claim 5, wherein light emissive compound is attached to a solid material. 

5 

9. The method of claim 3 wherein the plurality of individual units of the polymer are 
sequentially exposed to the agent by moving the polymer through a nanochannel in a wall 
material and exposing the plurality of individual units of the polymer to the agent at an 
interaction station at the nanochannel. 

10 

10. The method of claim 9, wherein the polymer is moved through a nanochannel in a wall 
material that has a light-emissive compound embedded in the wall material, adjacent the 
nanochannel, whereby the plurality of individual units interact with the light emissive compound 
as the polymer moves through the nanochannel. 

15 

1 1 . The method of claim 9, wherein said wall material comprises a plurality of nanochannels, 
an interaction station at the nanochannel, and further comprising moving a plurality of polymers 
through said nanochannel, only one polymer per nanochannel at any given time, and detecting 
simultaneously the signals resulting from the interaction of individual units of the polymers and 

20 the agent at said interaction station. 



12. The method of claim 1 further comprising storing in a database said signals which are 
detected. 



25 13. The method of claim 2 further comprising comparing the signals detected to a pattern of 
signals from another polymer to determine the relatedness of the two polymers. 
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14. The method of claim 2 further comprising comparing the detected signals to a known 
pattern of signals characteristic of a known polymer to determine the relatedness of the polymer 
being analyzed to the known polymer. 



5 15. The method of claim 2, wherein the plurality of individual units are two units, a first unit 
at a first end of the polymer and a second unit at an opposite second end of the polymer. 

16. The method of claim 1 5 further comprising measuring the length of time elapsed between 
detection of a first signal from the first unit and a second signal from the second unit. 

0 

17. The method of claim 2 further comprising detecting the time elapsed between the 
sequential detection of the signals. 



1 8. The method of claim 2, wherein a first of said individual units interacts with the agent to 
15 produce a first detectable signal and a second of said individual units interacts with the agent to 

produce a second detectable signal different from the first detectable signal. 

19. The method of claims 1-18, wherein the polymer is a nucleic acid. 

20 20. The method of claims 1-18, wherein only a portion of the units of the polymer are 
labeled. 

21. The method of claims 1-18, wherein only a portion of the units of the polymer are labeled 
and wherein the labeling is random. 

25 
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22. The methods of claims 1-18, wherein at least two units of the polymer are labeled 
differently so as to produce two different detectable signals. 

23. The method of claim 1 , wherein the plurality of individual units of the polymer are 

5 exposed to at least two agents, wherein the interaction between the units of the polymer and the 
at least two agents produces at least two signals. 

24. The method of claim 23, wherein the at least two agents are positioned in distinct regions 
of a channel through which the polymer passes. 

10 

25. The method of claim 23, wherein the at least two signals are different signals. 

26. The method of claim 23, wherein the at least two signals are the same signals. 

15 27. The method of claim 9, wherein the nanochanriel is fixed in the wall. 

28. The method of claim 1 , wherein the unit is exposed to the agent at a station and wherein 
the station is a non-ligand material. 

20 29. A method for analyzing a polymer of linked units comprising: 

moving a plurality of individual units of a polymer of linked units with respect to a 
station; and, 

detecting sequentially signals arising from a detectable physical change in the polymer or 
the station as individual units pass the station to analyze the polymer. 

25 
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30. The method of claim 29,wherein the station is an interaction station and wherein 
individual units are exposed at the interaction station to an agent that interacts with the individual 
unit to produce a detectable electromagnetic radiation signal characteristic of said interaction. 

5 31. The method of claim 29,wherein the station is a signal generation station and the 
characteristic signal produced is a polymer dependent impulse. 

32. A method for analyzing a polymer of linked units, comprising: 

exposing a plurality of individual units of a polymer to a station to produce a a non-ion 
10 conductance signal resulting from the exposure of the units of the polymer to the station, and 
wherein the station is attached to a wall material having a surface defining a channel. 

33. A method for identifying an individual unit of a polymer comprising 

transiently exposing the individual unit of the polymer to an agent selected from the 
15 group consisting of electromagnetic radiation, a quenching source and a fluorescence excitation 
source, the identity of the individual unit being unknown, to generate an interaction with a 
detectable electromagnetic radiation signal characteristic of said individual unit. 

detecting said signal, and 

distinguishing said signal from signals generated from adjacent signal generating units of 
20 the polymer as an indication of the identity of the individual unit. 

34. The method of claim 33, wherein the agent is one or more fluorophores and the 
individual unit is transiently exposed by positioning the individual unit within energy transfer 
proximity of the agent, and wherein said signal is detected by detecting fluorescence energy 

25 transfer between the agent and the individual unit. 

35. The method of claim 34, wherein the agent is at least three fluorophores. 
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36. The method of claims 33-34, wherein the individual unit of the polymer is exposed to the 
agent by positioning the individual unit at an interaction station comprising a nanochannel in a 
wall material. 

5 37. The method of claim 36, wherein the wall material comprises two layers, one of the 
layers being conductive and the other being nonconductive and wherein the nanochannel 
traverses both layers. 



38. A method for identifying an individual unit of a polymer comprising 

10 transiently moving the individual unit of the polymer relative to a station, the identity of 

the individual unit being unknown, 

detecting a signal arising from a detectable physical change in the unit or the station, and 

distinguishing said signal from signals arising from exposure of adjacent signal 
generating units of the polymer to the station as an indication of the identity of the individual 
15 unit. 

39. The method of claim 38, wherein the station is an interaction station and wherein 
individual units are exposed at the interaction station to an agent that interacts with the individual 
unit to produce a detectable electromagnetic radiation signal characteristic of said interaction. 

20 

40. The method of claim 38, wherein the station is a signal generation station and the signal 
produced is a polymer dependent impulse. 

41 . A method for determining the proximity of two individual units of a polymer of linked 
25 units, comprising: 

moving the polymer relative to a station, 
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exposing individual units to the station to produce a characteristic signal arising from a 
detectable physical change in the unit or the station, detecting characteristic signals generated, 
and 

measuring the amount of time elapsed between detecting characteristic signals, the 
5 amount of time elapsed being indicative of the proximity of the two individual units. 

42. The method of claim 41 , wherein the station is an interaction station and wherein 
individual units are exposed at the interaction station to an agent that interacts with the individual 
unit to produce a detectable electromagnetic radiation signal characteristic of said interaction. 

10 

43. The method of claim 42, wherein the agent is selected from the group consisting of 
electromagnetic radiation, a quenching source and a fluorescence excitation source. 

44. The method of claim 41 , wherein the station is a signal generation station and the signal 
15 produced is a polymer dependent impulse. 

45. The method of claims 41-44, wherein the polymer is a nucleic acid. 

46. The method of claims 41-43, wherein the individual units of the polymer are labeled with 
20 a tluorophore. 

47. The method of claims 41-46, wherein the two individual units are randomly labeled 
individual units of the polymer. 

25 4K. The method of claims 42-43, wherein the interaction station comprises a nanochannel in 
a wall material. 
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49. A method for determining the order of two individual units of a polymer of linked units 
comprising: 

moving the polymer linearly with respect to a station, 

5 exposing one of the individual units to the station to produce a signal arising from a 

detectable physical change in the unit or the station, 

exposing the other of the individual units to the station to produce a second detectable 
signal arising from a detectable physical change in the unit or the station, different from the first 
signal; and 

10 determining the order of the signals as an indication of the order of the two individual units. 

50. The method of claim 49, wherein the station is an interaction station and wherein one of 
the individual units is exposed at the interaction station to an agent that interacts with the one 
individual unit to produce a first detectable electromagnetic radiation signal, and wherein the 

15 other of the individual units is exposed at the interaction station to an agent that interacts with the 
other of the individual units to produce a second detectable electromagnetic radiation signal. 

51. The method of claim 50, wherein the agent is selected from the group consisting of 
electromagnetic radiation, a quenching source and a fluorescence excitation source. 

20 

52. The method of claim 49, wherein the station is a signal generation station and the signal 
produced is a polymer dependent impulse. 

53. The method of claims 49-52, wherein the polymer is a nucleic acid. 

25 

54. The method of claims 49-5 1 , wherein the individual units of the polymer are labeled with 
a fluorophore. 
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55. The method of claims 49-52, wherein the two individual units are randomly labeled 
individual units of the polymer. 

56. The method of claims 50-51, wherein the interaction station comprises a nanochannel in 
5 a wall material. 



57. A method for determining the distance between two individual units of a polymer of 
linked units comprising: 



(1 ) causing the polymer to pass linearly relative to a station, 

10 (2) detecting a characteristic signal generated as each of the two individual units 

passes by the station, 

(3) measuring the time elapsed between the signals measured, 

(4) repeating steps 1 , 2 and 3 for a plurality of similar polymers to produce a data 
set, and 

15 (5) determining the distance between the two individual units based upon the 

information obtained from said plurality of similar polymers by analyzing the 
data set. 



58. The method of claim 57, wherein the station is a signal generation station. 

20 

59. The method of claim 58, wherein the characteristic signal is a polymer dependent 
impulse. 

60. The method of claim 57, wherein step (2) comprises measuring an electromagnetic 
25 radiation signal generated. 
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6 1 . The method of claim 57, wherein the plurality of similar polymers is a homogeneous 
population. 

62. The method of claim 57, wherein the plurality of similar polymers is a heterogenous 
5 population. 

63. The method of claim 57, wherein the plurality of similar polymers is randomly labeled. 

64. The method of claims 57-63, wherein the polymer is a nucleic acid. 

10 

65. The method of claim 58, wherein the two linked units are detected at the signal 
generation station by measuring light emission at the station. 

66. The method of claim 57, wherein the steps (1 ) - (4) are carried out substantially 
15 simultaneously. 

67. The method of claim 57, wherein the station is an interaction station and the interaction 
station is a nanochannel. 

20 68. An article of manufacture, comprising: 

a wall material having a surface defining a channel, 

an agent wherein the agent is selected from the group consisting of an electromagnetic 
radiation source, a quenching source, a luminescent film layer and a fluorescence excitation 
source, attached to the wall material adjacent to the channel, wherein the agent is close enough to 
25 the channel and is present in an amount sufficient to detectably interact with a partner compound 
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selected from the group consisting of a light emissive compound, a light accepting compound, 
radiative compound, and a quencher passing through the channel. 

69. The article of claim 68, wherein the agent is an electromagnetic radiation source and 
5 wherein the electromagnetic radiation source is a light emissive compound. 

70. The article of claim 69, wherein the channel is a microchannel. 

71 . The article of claim 69, wherein the channel is a nanochannel. 

10 

72. The article of claim 69, wherein the surface of the wall material defining the 
microchannel is free of the light emissive compound. 

73. The article of claim 69, wherein the light emissive compound is attached to an external 
15 surface of the wall material. 

74. The article of claim 73, wherein the light emissive compound is attached to a linker 
which is attached to the external surface of the wall material. 

20 75. The article of claim 73, wherein the light emissive compound is concentrated at a region 
of the external surface of the wall material that surrounds a portion of the channel. 

76. The article of claim 73, further comprising a masking layer having openings which allow 
exposure of only localized areas of the light emissive compound. 

25 
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77. The article of claim 69, further comprising a second light emissive compound different 
from the first attached to the wall material adjacent to the channel, wherein the light emissive 
compound is close enough to the channel and is present in an amount effective to detectably 
interact with a partner light emissive compound passing through the channel. 

5 

78. The article of claims 73-76, wherein the external surface of the wall material adjacent to 
the light emissive compound is a conducting layer. 

79. The article of claim 78, wherein the wall material comprises two layers, the conducting 
10 layer and a nonconducting layer. 

80. The article of claims 73-76, wherein the wall material comprises at least two layers, a 
first layer preventing signal generation and a second layer allowing signal generation. 

15 8 ! . The article of claims 73-76, wherein the external surface of the wall material adjacent to 
the light emissive compound is a light impermeable layer. 

82. The article of claim 80, wherein the wall material comprises two layers, the light 
impermeable layer and a support light permeable layer. 

20 

83. The article of claim 69, wherein the light emissive compound is embedded in the wall 
material. 

84. The article of claim 83, wherein the light emissive compound is concentrated at a region 
25 of the wall material that surrounds a portion of the channel. 
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85. The article of claim 84, wherein the light emissive compound forms a concentric ring in 
the wall material around a portion of the channel. 

86. The article of claim 83, further comprising a second light emissive compound different 
5 from the first attached to the wall material adjacent to the channel, wherein the light emissive 

compound is close enough to the channel and is present in an amount effective to detectably 
interact with a partner light emissive compound passing through the channel. 

87. The article of claim 86, wherein the second light emissive compound is embedded in the 
10 wall material. 

88. The article of claims 83-87, wherein the wall material comprises a first conducting layer 
adjacent to a first side of the light emissive compound. 

15 89. The article of claim 88, wherein the wall material comprises a second conducting layer 
adjacent to a second side of the light emissive compound, the first and second layers 
sandwiching the light emissive compound. 

90. The article of claims 83-87, wherein the wall material comprises a nonconducting 

20 material and wherein the light emissive compound is.ernbedded in the nonconducting material. 

91 . The article of claims 83-87, wherein the wall material comprises a first light impermeable 
layer on a first side of the light emissive compound. 

25 92. The article of claim 91 , wherein the wall material comprises a second light impermeable 
layer on a second side of the light emissive compound, the first and second layers sandwiching 
the light emissive compound. 
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93. The article of claims 83-87, wherein the wall material is a light permeable material. 

94. The article of claim 93, wherein the light emissive compound is embedded in the light 
permeable material. 

5 

95. The article of claims 69-86, wherein the light emissive compound is embedded in a layer 
of a light permeable material. 

96. The article of claim 89, wherein the light emissive compound is embedded in a layer of a 
10 light permeable material. 

97. The article of claim 68, wherein the light emissive compound is a fluorescent compound. 

98. The article of claim 69, wherein a length of the nanochannel is between 500 Angstroms 
15 and 1 mm. 

99. The article of claim 68, wherein a width of the channel is between 1 and 500 Angstroms. 

100. The article of claim 68, wherein the wall material is formed of two layers, a first light 
20 impermeable layer and a luminescent film layer attached to one another, wherein the channel 

extends through both layers and is defined by surfaces of both layers. 

101. The article of claim 1 00, wherein the channel is a nanochannel. 

25 1 02. The article of claim 101, further comprising a second light impermeable layer, the 
luminescent film layer positioned between the first and second light impermeable layers. 



BNSDOCiD: <WO 9S3S012A2 I > 



WO 98/35012 PCT/US98/03024 

-151- 

103. The article of claims 100-102, wherein the surface defining the channel includes a 
surface of the light impermeable layer which is free of luminescent film layer material. 

104. The article of claims 1 00-103, wherein the length of the channel is between 500 
5 Angstroms and 1 mm. 

105. The article of claim 68, wherein the agent is a fluorescence excitation source and wherein 
the fluorescence excitation source is a scintillation layer. 

10 106. The article of claim 105, wherein the scintillation layer is selected from the group 
consisting of Nal(TI), ZnS(Ag), anthracene, stilbene, and plastic phosphors. 

107. The article of claim 105, wherein the scintillation layer is embedded in the wall material 
between two radiation impermeable layers. 

15 

1 08. An article of manufacture, comprising: 

a wall material having a surface defining a plurality of channels. 

a station attached to a discrete region of the wall material adjacent to at least one of the 
channels, wherein the station is close enough to the channel and is present in an amount 
20 sufficient to cause a signal to arise from a detectable physical change in a polymer of linked units 
passing through the channel or in the station as the polymer is exposed to the station. 

109. The article of claim 108, wherein the station is an interaction station and wherein the 
interaction station is an electromagnetic radiation source. 

25 
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1 1 0. The article of claim 108, wherein the wall material is formed of two layers, a first light 
impermeable layer and a luminescent film layer attached to one another, wherein the channel 
extends through both layers and is defined by surfaces of both layers. 



5 111. The article of claims 108, wherein the station is a signal generation station and wherein 
the signal generation station produces a polymer dependent impulse. 



112. The article of claim 108, wherein the station is an electromagnetic radiation source and 
wherein the electromagnetic radiation source is a scintillation film. 

10 

113. The article of claim 112, wherein the scintillation film is embedded in the wall material 
between two radiation impermeable layers. 

1 14. The article of claims 1 09-1 1 3, wherein the channel is a microchannel. 

15 

115. The article of claims 1 09-1 1 3, wherein the channel is a nanochannel. 

116. An article of manufacture, comprising: 

a wall material having a surface defining a channel, 

20 a plurality of stations each attached to a discrete region of the wall material adjacent to 

the channel, wherein the stations are close enough to the channel and are present in an amount 
sufficient to cause a signal to arise from a detectable physical change in a polymer of linked units 
passing through the channel or in the station as the polymer is exposed to the station. 



25 117. The article of claim 116, wherein the stations are interaction stations. 
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1 1 8. The article of claims 1 16, wherein the stations are signal generation stations. 

1 19. The article of claim 1 1 6, wherein the station is an electromagnetic radiation source and 
wherein the electromagnetic radiation source is a scintillation film. 

5 

120. The article of claims 117-119, wherein the channel is a microchannel. 

121 . The article of claims 117-119, wherein the channel is a nanochannel. 



A method for analyzing a polymer of linked units comprising: > 

( 1 ) providing a labeled polymer of linked units 

(2) detecting sequentially signals from unit specific markers of less than all of 
the linked units, and 

(3) storing a signature of said signals detected to analyze the polymer. 
The method of claim 122, wherein the signals arc detected linearly. 



10 122. 



15 

123. 



124. The method of claim 122, wherein the signature of signals is at least 10 signals. 

20 125. The method of claim 122, wherein the signature of signals defines the order of unit 
specific markers. 



126. The method of claim 122, wherein the signature of signals defines the distance between 
unit specific markers. 

25 
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127. The method of claim 122, wherein the signature of signals defines the number of unit 
specific markers. 

128. The method of claim 122, wherein all of the unit specific markers are detected. 

5 

129. The method of claim 122, wherein only a portion of the unit specific markers are 
detected. 

130. The method of claim 122, wherein the polymer is partially and randomly labeled with a 
10 unit specific marker. 

131 . The method of claim 129, wherein all of the units of the polymer are labeled with a unit 
specific marker. 

15 132. The method of claim 122, wherein the labeled polymer of linked units is exposed to an 
agent selected from the group consisting of electromagnetic radiation, a quenching source and a 
fluorescence excitation source and wherein the signals are produced by the interaction between a 
unit specific marker of the polymer and the agent. 

20 133. The method of claim 122, wherein the labeled polymer of linked units is moved with 

respect to a station and wherein the signals are generated upon exposure of a unit specific marker 
of the polymer to the station. 

1 34. The method of claim 122, wherein the method is a method for identifying a unit specific 
25 marker of the polymer, the identity of the unit specific marker being indicative of the identity of 
at least one unit of the polymer, and wherein the unit specific marker is transiently exposed to a 
station to produce signals characteristic of said unit specific marker, and further comprising the 
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step of distinguishing said signal from signals generated from adjacent signal generating unit 
specific markers of the polymer as an indication of the identity of the unit specific marker. 

135. The method of claim 134, wherein the station is an agent selected from the group 

5 consisting of electromagnetic radiation, a quenching source and a fluorescence excitation source 
and wherein the signals are detectable electromagnetic radiation signals. 

136. The method of claim 122, wherein the method is a method for determining the proximity 
of l wo unit specific markers of the polymer wherein the proximity of the two unit specific 

10 markers is the signature of said signals, the identity of each unit specific marker being indicative 
of the identity of at least one unit of the polymer, wherein the labeled polymer is moved relative 
to a station to expose the two unit specific markers to the station to produce a characteristic 
signal arising from a detectable physical change in the unit specific marker or the station, and 
further comprising the step of measuring the amount of time elapsed between detecting each 

15 characteristic signal, the amount of time elapsed being indicative of the proximity of the two unit 
specific markers. 

1 37. The method of claim 122, wherein the method is a method for determining the order of 
two sequence unit markers of the polymer, the identity of each unit specific marker being 

20 indicative of the identity of at least one unit of the polymer wherein the order of the two unit 
specific markers is the signature of said signals, wherein the labeled polymer is moved linearly 
with respect to a station, to expose one of the unit specific markers to the station to produce a 
signal which is a signature of signals, and to expose the other of the unit specific markers to the 
station to produce a second detectable which is a signature of said signals, different from the first 

25 signal, and further comprising the step of determining the order of the signals as an indication of 
the order of the two unit specific markers. 

1 38. The method of claim 122, wherein the method is a method for determining the distance 
between two unit specific markers of the polymer, the identity of each unit specific marker being 
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indicative of the identity of at least one unit of the polymer wherein the distance between two 
unit specific markers is the signature of said signals, wherein the labeled polymer is moved 
linearly relative to a station to produce a characteristic signal generated as each of the two unit 
specific markers passes by the station, and further comprising the step of determining the 
5 distance between the signals as an indication of the distance between the two unit specific 
markers. 

139. A method for preparing a wall material comprising covalently bonding light emissive 
compounds or quenching compounds to a plurality of discrete locations of a wall material, each 

10 of said discrete locations close enough to a respective interaction station on said wall material, 
whereby when an individual unit of a polymer, which is interactive with said light emissive 
■' compound or quenching compound to produce a signal, is positioned at said interaction station, 
the light emissive compound or the quenching compound interacts with the individual unit to 
produce said signal. 

15 

140. The method of claim 139, wherein the light emissive compounds or quenching 
compounds are covalently bonded at discrete locations close to channels in said wall material, 
said channels defining interaction stations. 

20 141 . The method of claim 140, wherein the channels are nanochannels. 

142. The method of claim 140, wherein the channels are microchannels. 

143. The method of claim 140, wherein the light emissive compounds or quenching 

25 compounds are covalently bonded to the wall material in a manner whereby the surfaces of the 
wall material defining the channel are free of the light emissive compounds and quenching 
compounds. 
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144. The method of claim 141, wherein the light emissive compounds or quenching 
compounds are covalently bonded to the wall material in a manner whereby the surfaces of the 
wall material defining the channel are free of the light emissive compounds and quenching 
compounds. 

5 

145. The method of claims 141-144 further comprising applying a layer of conductive 
material to said wall material. 

146. A method for attaching a chemical substance selectively at a rim of a channel through a 
10 : .wall material that is opaque comprising: 

providing a wall material with photoprotective chemical groups attached at the rim of the 
channel through the wall material, 

applying light to the photoprotective chemical groups to dephotoprotect the chemical groups, 

and 

15 attaching the chemical substance to the deprotected chemical groups. 

147. The method of claim 146, wherein the light is applied to only selected regions of a 
surface of the wall material defining the rim of the channel. 

20 1 48. The method of claim 146, wherein the channel has a first end and a second end, the rim 
being at the first end, and wherein the light is applied to the second end, the light passing through 
the channel to contact the photoprotected chemical groups at the rim of said first end. 

149. The method of claims 146-148, wherein the channel is a microchannel. 

25 

150. The method of claims 146-148, wherein the channel is a nanochannel. 
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151. A method for preparing a wall material having localized areas of light emission on a 
surface of the wall material comprising: 

providing a wall material having a surface, and 

applying a light emissive compound to the surface to produce at least localized areas of light 
5 emission on the surface, wherein the localized areas define a target region for detecting light 
emission, and wherein the target region is a rim of a channel through the wall material. 

152. The method of claim 151, further comprising the steps of attaching a photoprotective 
chemical group to the surface of the wall material, applying light to the photoprotective chemical 

lo groups to dephotoprotect the chemical groups prior to attaching the light emissive compound, 
and attaching the light emissive compound to the dephotoprotected chemical groups. 

153. The method of claim 152, wherein the light is applied to only selected regions of the 
surface of the wall material defining the rim of the channel. 

15 

1 54. The method of claim 1 52, wherein the photoprotective chemical group is attached to only 
selected regions of the surface of the wall material defining the rim of the channel. 

155. The method of claim 152, wherein the channel has a first end and a second end, the rim 
20 being at the first end, and wherein the light is applied, to the second end, the light passing through 

the channel to contact the photoprotected chemical groups at the rim of said first end. 

1 5<>. The method of claims 152-1 55, wherein the channel is a microchannel. 
25 1 57. The method of claims 152-155, wherein the channel is a nanochannel. 
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158. The method of claim 151, further comprising the step of positioning a mask having 
openings over the surface of the wall material such that only localized areas of light emission are 
exposed through the openings of the mask. 



5 1 59. The method of claim 1 58, wherein the light emissive compound is attached to a portion 
of the surface of the wall material. 

1 60. A method for detecting resonance energy transfer or quenching between two interactive 
partners capable of such transfer or quenching comprising 

bringing the two partners in close enough proximity to permit such transfer or quenching; 

' v 

applying an agent to one of said partners, the agent selected from the group consisting of 
electromagnetic radiation, a quenching source and a fluorescence excitation source, 

shielding fluorescence resonance energy transfer and quenching occurring from 
electromagnetic radiation emission and interaction between said partners with a material shield, 
and 

detecting the emitted electromagnetic radiation. 

161. An apparatus for detecting a signal comprising, 

a housing with a buffer chamber 

20 a wall defining a portion of the buffer chamber, and having a plurality of openings for 

aligning polymers, 

a sensor fixed relative to the housing, the sensor distinguishing the signals emitted at 
each opening from the signals emitted at the other of the openings to generate opening dependent 
sensor signals, and 

25 a memory for collecting and storing said sensor signals. 



10 



15 
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162. The apparatus of claim 161, wherein the sensor is an optical sensor and the optical sensor 
senses electromagnetic radiation signals emitted at the plurality of openings. 

163. The apparatus of claim 161, further comprising a microprocessor. 

5 

164. The apparatus of claim 161, wherein the openings are defined by microchannels in the 
wall. 

165. The apparatus of claim 161, wherein the openings are defined by nanochannels in the 
10 wall. 



166. The apparatus of claims 161-165, wherein the plurality is at least 50. 



167. The apparatus of claims 161-165, wherein the apparatus further comprises a second 
15 buffer chamber separated from said first buffer chamber, by said wall, and wherein the buffer 
chambers are in fluid communications with one another via the openings. 



168. The apparatus of claim 166, wherein the apparatus further comprises a second buffer 
chamber separated from said first buffer chamber, by said wall, and wherein the buffer chambers 
20 are in fluid communications with one another via the openings. 



169. The apparatus of claim 167 further comprising a pair of electrodes secured to the 
housing, one of said pair positioned in the first buffer chamber and the other of the pair 
positioned in the second buffer chamber. 

25 

1 70. An apparatus for detecting a signal comprising 
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a housing defining a first buffer chamber and a second buffer chamber, 

a wall supported by the housing and separating the first and second buffer chambers, 

a plurality of channels defined by the wall and providing fluid communications between the 
first and second buffer chambers, and 

5 a sensor for distinguishing and collecting channel dependent signals. 

171 . The apparatus of claim 170, wherein the channels are nanochannels. 

1 72. The apparatus of claim 170, wherein the channels are microchannels. 

10 

173. The apparatus of claim 170, wherein an agent selected from the group consisting of 
electromagnetic radiation, a quenching source and a fluorescence excitation source is attached to 
the wall. 

15 174. The apparatus of claim 173, wherein the agent is electromagnetic radiation and wherein 
the electromagnetic radiation is a light emissive compound 

175. The apparatus of claim 174, wherein the light emissive compound is concentrated at the 
channels in the wall. 

20 

176. The apparatus of claims 170-175 further comprising means for moving biological entities 
through the channels. 

177. A method for characterizing a test polymer comprising, 

25 obtaining polymer dependent impulses for a plurality of polymers, 

comparing the polymer dependent impulses of the plurality of polymers, 
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determining the relatedness of the polymers based upon similarities between the polymer 
dependent impulses of the polymers, and 

characterizing the test polymer based upon the polymer dependent impulses of related 
polymers. 

5 

178. The method of claim 177, wherein the plurality of polymers is a homogenous population. 

1 79. The method of claim 1 77, wherein the plurality of polymers is a heterogeneous 
population. 

10 

1 80. The method of claim 1 77, wherein the polymer is randomly labeled. 

181. The method of claim 1 77, wherein the polymer is a polymer of at least two different 
linked units, and wherein said at least two different linked units are labeled to produce different 

15 signals. 

182. The method of claims 1 77- 1 8 1 , wherein the polymer is a nucleic acid. 

183. The method of claim 1 82, wherein the obtained polymer dependent impulses include an 
2u order of polymer dependent impulses. 

1 84. The method of claim 1 82, wherein the obtained polymer dependent impulses includes the 
time of separation between specific signals. 

25 1 85. The method of claim 1 82, wherein the polymer dependent impulses are obtained by 
moving the plurality of polymers lineally past a signal generation station. 
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186. The method of claim 182, wherein the obtained polymer dependent impulses include a 
number of polymer dependent impulses. 



1 87. A method for sequencing a polymer of linked units comprising, 

5 obtaining polymer dependent impulses from a plurality of overlapping polymers, at least 

a portion of each of the polymers having a sequence of linked units identical to the other of the 
polymers, and 

comparing the polymer dependent impulses to obtain a sequence of linked units which is 
identical in the plurality of polymers. 

10 

188. The method of claim 187, wherein the polymer dependent impulses are optically 
detectable. 

1 89. The method of claim 1 87, wherein the plurality of polymers is a homogeneous 
15 population. 

190. The method of claim 187, wherein the plurality of polymers is a heterogeneous 
population. 

20 191. The method of claim 187, wherein the plurality of polymers is randomly labeled. 

192. The method of claims 187-191, wherein the polymers are nucleic acids. 

193. The method of claim 192, wherein the nucleic acids are labeled with an agent selected 
25 from the group consisting of an electromagnetic radiation source, a quenching source and a 

fluorescence excitation source. 
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194. A method for labeling nucleic acids comprising, 
contacting a dividing cell with a nucleotide analog, 

isolating from the cell nucleic acids that have incorporated the nucleotide analog, and 

modifying the nucleic acid with incorporated nucleotide analog by labeling the incorporated 
5 nucleotide analog. 

195. The method of claim 194, wherein the nucleotide analog is a brominated analog. 

196. The method of claim 194, wherein the dividing cell is contacted with a nucleotide analog 
10 by 

growth arresting the cell in the cell division cycle, 
performing the contacting step, and 
allowing the cell to reenter the cell division cycle. 

15 197. The method of claim 194, wherein the nucleic acids are isolated after the cells have 
reentered and completed the cell division cycle and before a second cell division cycle is 
completed. 

198. The method of claims 194-197, wherein the incorporated nucleotide analog is labeled 
20 with an agent selected from the group consisting of an electromagnetic radiation source, a 

quenching source, a fluorescence excitation source, and a radiation source. 

1 99. A method for determining the order of units of a polymer of linked units comprising: 

(1 ) moving the polymer linearly relative to a station, 

25 (2) measuring a polymer dependent impulse generated as each of two individual 

units, each giving rise to a characteristic signal, pass by the station, 
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(3) 



repeating steps 1 and 2 for a plurality of similar polymers, and 



information 



(4) 



determining the order of at least the two individual units based upon the 
obtained from said plurality of similar polymers. 



5 200. The method of claim 199, wherein the station is a signal generation station. 



201 . The method of claim 199, wherein the station is an interaction station. 



202. The method of claim 199, wherein step (2) comprises measuring an electromagnetic 
10 radiation signal generated. 



203. The method of claim 199, wherein the plurality of similar polymers is a homogeneous 



15 204. The method of claim 199, wherein the plurality of similar polymers is a heterogenous 
population. 

205. The method of claims 199, wherein the polymer is a nucleic acid. 

20 206. A method for analyzing a set of polymers, each polymer of said set being an individual 
polymer of linked units comprising: 

orienting the set of polymers parallel to one another, and 

detecting a polymer specific feature of said polymers. 

25 207. The method of claim 206, wherein the polymers are oriented by applying an electric field 
to said polymers. 
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208. The method of claim 206, wherein the polymer specific feature is an order of linked unity 
in the polymers. 



209. The method of claim 206, wherein the detecting step is performed simultaneously for 
said polymers. 



2 1 0. The method of claim 206, wherein the detection step comprises measuring 
electromagnetic radiation signals. 



211. The method of claim 206, wherein the detection step comprises causing the polymers to 
pass linearly relative to a plurality of signal generation stations, and detecting and distinguishing 
signals generated as said polymers pass said interaction stations. 



2 1 2. The method of claim 206-2 1 1 wherein the polymers are a homogenous population. 

213. The method of claims 206-2 1 1 wherein the polymers are a heterogenous population. 

214. The method of claims 206-21 1 wherein the polymers are randomly labeled. 

215. The method of claims 206-2 1 1 wherein the orientation step is in a solution free of gel. 

216. A method for analyzing a set of polymers, each polymer of the set being an individual 
polymer of linked units, comprising: 

orienting the set of polymers in an electric field, 

simultaneously moving the set of polymers through defined respective channels, and 
detecting a polymer specific feature as the polymers are moved through the channels. 



9835012A2 I > 



WO 98/35012 PCT/US98/03024 

- 167- 

217. The method of claim 216 wherein the channels are nanochannels. 

218. The method of claim 2 1 6, wherein the polymer specific feature is an order of linked unity 
in the polymers. 

219. The method of claim 216, wherein the detecting step is performed simultaneously for 
said polymers. 



220. The method of claim 216, wherein the detection step comprises measuring 
10 electromagnetic radiation signals. 

221. The method of claim 2 1 6, wherein the detection step comprises causing the polymers to 
pass linearly relative to a plurality of signal generation stations, and detecting and distinguishing 
polymer dependent impulses generated as said polymers pass said signal generation stations. 

15 

222. The method of claims 216-221 wherein the polymers are a homogenous population. 

223. The method of claims 216-221 wherein the polymers are a heterogenous population. 
2o 224. The method of claims 21 6-221 wherein the polymers are randomly labeled. 

225. The method of claims 216-221 wherein the orientation step is in a solution free of gel. 



226. An apparatus for detecting optically a plurality of signals comprising: 
a housing with a buffer chamber, 
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a wall material defining a portion of the buffer chamber, the wall including polymer 
interaction stations, and 

an optical sensor secured to the housing, the optical sensor constructed and arranged to 
detect electromagnetic radiation signals emitted at the interaction stations. 

5 

227. A computer system for making characteristic information of a plurality of polymers of 
linked units available in response to a request, comprising: 

a memory for storing, for each of the plurality of the polymers and in a manner accessible 
using a unique identifier for the polymer, records including information indicative of sequentially 
10 detected signals arising from a detectable physical change in the plurality of individual units of 
the polymer or a station to which the polymer is exposed; and 

a processor for accessing the records stored in the memory for a selected one of the 
plurality of the polymers according to a unique identifier associated with the selected polymer. 

15 228. The system of claim 227, wherein the sequentially detected signals arise from an 

interaction of the plurality of individual units of the polymer exposed to an agent selected from 
the group consisting of electromagnetic radiation, a quenching source and a fluorescence 
excitation source 

20 229. The computer system of claim 227, further comprising: 

means for comparing the sequentially detected signals of the selected polymer to a known pattern 
of signals characteristic of a known polymer to determine relatedness of the selected polymer to 
the known polymer. 
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